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^ materials for modulating the interaction between HER3 and heregulin. 



BEST AVAILABLF COPY 



WO 03/011897 



PCT/US02/23963 



METHODS AND MATERIALS FOR CHARACTERIZING AND 
MODULATING INTERACTION BETWEEN HEREGULIN AND HER3 

RELATED APPLICATIONS 
This application claims priority under Section 119(e) from U.S. Provisional 
Application Serial No. 60/308,431 filed July 27, 2001, the contents of which are 
incorporated herein by reference. 
5 FIELD OF THE INVENTION 

This invention relates generally to HER3 compositions, and to methods of using 
HER3 compositions to modulate the interaction between heregulin and HER3 and for 
example, the biological activities of heregulin and HER3. The invention also relates to 
methods of identifying molecules that bind to the heregulin binding site of HER3. The 
10 invention also relates to methods for in vitro, in situ, and/or in vivo diagnosis and/or 
treatment of mammalian cells or pathological conditions associated with HER3 and 
heregulin. 

BACKGROUND OF THE INVENTION 
15 Cancers are the second most prevalent cause of death in the United States, 

causing 450,000 deaths pet year. One in three Americans will develop cancer, and one in 
five will die of cancer. While substantial progress has been made in identifying some of 
the likely environmental and hereditary causes of cancer, there is a need for substantial 
improvement in the diagnosis and therapy for cancer and related diseases and disorders. 
20 A number of so-called cancer genes, i.e., genes that have been implicated in the 

etiology of cancer, have been identified in connection with hereditary forms of cancer 
and in a large number of well-studied tumor cells. Cancer genes are broadly classified 
into "oncogenes" which, when activated, promote tumorigenesis, and "tumor suppressor 
genes" which, when damaged, fail to suppress tumorigenesis. While these classifications 
25 provide a useful method for conceptualizing tumorigenesis, it is also possible that a 
particular gene may play differing roles depending upon the particular allelic form of that 
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gene, its regulatory elements, the genetic background and the tissue environment in 
which it is operating. Human epidermal growth factor receptor 3 is one of these genes 
implicated in the etiology of cancer (see, e.g. Munster et al, Cancer Res. 2002 Jun 
l;62(ll):3132-7; Menard et al. J Cell Physiol 2000 Feb;l 82(2): 150-62; Basso et al., 
5 Oncogene 2002 Feb 14;2 1(8): 11 59-66; and Yarden Oncology 2001;61 Suppl 2:1-13). 

Human epidermal growth factor receptor 3 (HER3) (see, e.g. Kraus et al., (1989) 
Proc. Nad. Acad. Sci. U. S. A. 86, 9193-9197) is a member of the type I receptor tyrosine 
kinase (RTK) family, which also includes EGFR, HER2/;w«, and HER4 (see, e.g. Ullrich 
et al., (1984) Nature 309, 418-425; Schechter et al., (1985) Science 229, 976-978; 

10 Plowman et al.,. (1993) Proc. Nad. Acad. Sci. U. S. A. 90, 1746-1750). HER3 forms 
heterodimers with other members of the type I RTK family, including the HER2/ neu 
receptor (see, e.g. Sliwlcowski et al., (1994) J. Biol. Chem. 269, 14661-14665; Fitzpatrick 
et aL, (1998) FEBS Lett 431, 102-106; Heldin, C H. (1995) Cell 80, 213-223; Tzahar et 
al, (1997) EMBO J. 16, 938-4950). The HER2/neu receptor is amplified and 

15 overexpressed in 25-30% of human breast and 8-11% off human ovarian cancers. This 
overexpression correlates with increased morbidity and mortality, and there is evidence 
that the overexpressed HER2 receptor leads to aggressive malignancies (see, e.g. Slamon 
et aL, (1987) Science 235, 177-182; Slamon et aL, (1989) Science 244, 707-712; Plowman 
et al, (1993) Nature 366, 473-475; Dougall et aL, (1996) DNA Cell Biol. 15, 31-40). 

20 Cells expressing only HER2 receptors alone and not other members of the 

EGFR family fail to bind heregulin, but HER2/neu has high tyrosine kinase activity. 
HER3 is a kinase defective receptor (see, e.g. Guy et al. (1994) Proc Natl Acad Sci U S A 
91(17), 8132-6), but has binding affinity for heregulin (see, e.g. Carraway et aL (1994) / 
Biol Chem 269(19), 14303-6). The HER2/HER3 heterodimer forms a high affinity 

25 heregulin receptor with tyrosine kinase activity. Heregulin binding to cells that display 
the HER2/HER3 heterodimer causes a mitogenic response both in vitro and /// vivo, so 
understanding this interaction is of medical importance (see, e.g. Aguilar et aL, (1999) 
Oncogene 18, 6050-6062; Sliwkowski et al. (1994) J Biol Chem 269(20), 14661-5; Heldin, 
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C. H. (1995) Cell 80(2), 213-23; Tzahar et aL (1997) Embo J 16(16), 4938-50). Alternate 
transcripts of HER3 isolated from an ovarian carcinoma-derived cell line have been 
identified which encode truncated forms of the extracellular domain of HER3, including 
three clones where the protein products were soluble secreted proteins (see, e.g. Lee, H., 
5 and Maihle, N. J. (1998) Oncogene 16(25), 3243-52). A naturally occurring secreted form 
of HER3 has been found to inhibit heregulin-stimulated activation of HER3 (see, e.g. 
Lee et al. (2001) Cancer Res 61(11), 4467-73). This provides evidence that HER3 could be 
an important target in breast cancer therapy. 

Type I receptor tyrosine kinases typically contain four extracellular domains, a 

10 single hydrophobic transmembrane segment, and a cytoplasmic tyrosine kinase domain 
(see, e.g. Ullrich, A., and Schlessinger, J. (1990) Cell 61, 203-212). HER2/W is a very 
active tyrosine kinase, but cells expressing HER2///*# alone, and not other members of 
the EGFR family, fail to bind heregulin. Conversely, the HER3 receptor binds heregulin 
but has low tyrosine kinase activity (see, e.g. Guy et al., (1994) Proc, Nad. Acad. Sci. U. S. 

15 A. 91, 8132-8136; Carraway et al., (1994) J. Biol. Chem. 269, 14303-14306). As noted 
above, the HER2/HER3 heterodimer is a high affinity heregulin binding complex with 
signaling activity through the HER-2 kinase domain. To date, the domains of HER3 
involved in ligand binding and heterodimerization have not been identified. 

Thus far, the high carbohydrate content (see, e.g. Horan et al., (1995) J. Biol. 

20 Chem. 270, 24604-24608) and the relatively large size (-180 kDa) of the receptors in the 
EGFR family have frustrated structural analysis by x-ray crystallography and NMR, so 
other methods have been sought to iUuminate the structure and function of HER3. The 
extracellular domains (ECDs) of the type I RTKs have been divided into four domains: 
I, II, III, and IV, based on sequence analysis (see, e.g. Yarden, Y., and Ullrich, A. (1988) 

25 Annu. Rev. Biochem. 57, 443-478). Domains II and IV are cysteine-rich and are similar 
in sequence. Domains I and JU also have sequence similarity (see, e.g. Yarden, Y., and 
Ullrich, A. (1988) Annu. Rev. Biochem. 57, 443-478; Lax et al, (1988) Mol Cell. Biol. 8, 
1970-1978). Litde is known about the specific function of each domain except in EGFR, 
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where several lines of evidence provide evidence that the major determinants for EGF 
binding lie in domain III, These lines of evidence include the following: 1) the exchange 
of domain III in chicken EGFR for domain III from human EGFR confers binding of 
human EGF (see, e.g. Lax et aL, (1989) EMBO J. 8, 421-427; Lax et aL, (1991) Cell 
5 ReguL 2, 337-345); 2) monoclonal antibodies that recognize residues in domain III 
prevent EGF binding to EGFR (see, e.g. Wu et aL, (1989) J. Biol. Chem. 264, 17469- 
17475); 3) cross-linking of EGF to EGFR identified residues in domain III (see, e.g. 
Summerfield et aL, (1996) J. Biol. Chem. 271, 19656-19659; Wu et aL, (1990) Proc. Nad. 
Acad Sci. U. S. A. 87, 3151-3155); and 4) limited proteolysis of the ECD of EGFR 

10 produced a fragment that encompassed domain III, which bound transforming growth 
factor, with the observation that binding could be enhanced by including portions of 
domain IV (see, e.g. Kohda et aL, (1993) J. Biol. Chem. 268, 1976-1981). Additional 
studies from cross-linking experiments indicated that bound EGF is also close to 
tyrosine 101 in domain I of murine EGFR (see, e.g. Woltjer et aL, (1992) Proc. Nad. 

15 Acad. Sci. U. S. A. 89, 7801-7805). Taken together, these experimental results provide 
evidence that domain I and III are close to the ligand-binding region in EGFR and that 
domain III contributes most of the binding. HER3 and EGFR have relatively, high 
sequence identity (45% identity in the ECD) and belong to the same family of type 1 
tyrosine kinase receptors; however, they bind a different subset of ligands and differ in 

20 preference for heterodimerization versus homodimerization (see, e.g. Tzahar et aL, (1997) 
EMBO J. 16, 938-4950; Lemmon et aL, (1997) EMBO J. 16, 281-294; Huang et aL, 
(1998) Biochem. J. 331, 113-119; Alimandi et aL, (1995) Oncogene 10, 18134821). 

While the existing art provides a limited understanding of the structure of HER3 
and the interaction between heregulin and HER3, this art does not delineate the domains 

25 in HER3 responsible for interacting with heregulin. Consequendy there is a need in the 
art for the identification and characterization of the domains in HER3 involved in this 
interaction so that methods and materials for modulating this interaction can be 
generated. The disclosure provided herein meets this need. 
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SUMMARY OF THE INVENTION 
The disclosure provided herein identifies and characterizes the domains in the 
HER3 receptor that interact with the heregulin ligand. In this context, the present 
5 invention provides antagonists or agonists of the HER3 receptor such as soluble HER3 
variants and methods for using them. The antagonists and agonists described herein find 
utility for, among other tilings, in vitro, in situ, or in vivo diagnosis or treatment of 
mammalian cells or pathological conditions associated with the aberrant expression of 
heregulin and/or HER3. 

10 A preferred embodiment of the invention is a HER3 variant polypeptide 

comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 
substitutions at the following residue position(s) in SEQ ID NO: 2: E43; N44; K51; E64; 
V66; and VI 10. Another embodiment of the invention is a non-naturally occurring 

15 HER3 variant polypeptide consisting essentially of amino acid 20 to amino acid 329 of 
HER3 polypeptide sequence of SEQ ID NO: 2; wherein the HER3 variant polypeptide 
specifically binds to the heregulin polypeptide of SEQ ID NO: 4 and exhibit's an 
impaired ability to interact with the HER2 polypeptide of SEQ ID NO: 6. 

Related embodiments of this invention include an isolated nucleic acid 

20 comprising a nucleotide sequence encoding these HER3 variants, vectors comprising 
nucleic acids encoding these variants and host cells (e.g. E. coll) containing these vectors. 
Another related embodiment of the invention is a method of making HER3 variant 
polypeptide by providing a host cell comprising a vector encoding a HER3 variant, 
providing culture media, culturing the host cell in the culture media under conditions 

25 sufficient to express the HER3 variant polypeptide, recovering the HER3 variant 
polypeptide from the host cell or culture media, and then purifying the HER3 variant 
polypeptide. Optionally the HER3 variant polypeptide is conjugated or linked to one or 
more polyol groups. 



5 



WO 03/011897 



PCT/US02/23963 



The invention also provides methods of conducting screening assays to identify 
candidate molecules, such as small molecule compounds, polypeptides or antibodies, 
which act as agonists or antagonists with respect to the interaction between heregulin 
and HER3. One embodiment of the invention is a method of identifying a compound 
5 which specifically binds the heregulin binding domain in a HER3 variant polypeptide 
comprising amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ ID 
NO: 2 comprising contacting the HER3 variant polypeptide with a test compound under 
conditions favorable to binding and determining whether the test compound specifically 
binds to the HER3 variant polypeptide such that a compound which binds to the HER3 

10 variant polypeptide can be identified. Optionally this method can further comprise 
determining whether the test compound inhibits the heregulin induced tyrosine kinase 
activity by contacting mammalian cells that express HER3 receptor with heregulin in the 
presence and absence of the test compound and then monitoring the mammalian cells 
for the tyrosine kinase activity associated with the HER2/HER3 polypeptide complex, 

1 5 wherein an inhibition in tyrosine kinase activity in the presence of the test compound as 
compared to the absence of the test compound identifies the test compound as an 
inhibitor of heregulin induced tyrosine kinase activity. Alternatively, the method can 
further comprise determining whether the test compound enhances the heregulin 
induced tyrosine kinase activity by contacting mammalian cells that express HER3 

20 receptor with heregulin in the presence and absence of the test compound and then 
monitoring the mammalian cells for the tyrosine kinase activity associated with the 
HER3 polypeptide, wherein an increase in tyrosine kinase activity in the presence of the 
test compound as compared to the absence of the test compound identifies the test 
compound as an enhancer of heregulin induced tyrosine kinase activity. 

25 Yet another embodiment of the invention is a method of determining whether a 

test compound modulates the interaction between heregulin polypeptide of SEQ ID 
NO: 2 and a HER3 variant polypeptide comprising amino acid 20 to amino acid 329 of 
SEQ ID NO: 2 by contacting the HER3 variant polypeptide with a test compound under 
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conditions favorable to binding, contacting the HER3 variant polypeptide with heregulin 
under conditions favorable to binding and then comparing the binding interaction 
between the HER3 variant polypeptide and heregulin with the binding interaction 
between HER3 variant polypeptide and heregulin in the absence of the test compound 
5 such that a compound which modulates the interaction between heregulin and the HER3 
variant polypeptide can be identified. 

Another embodiment of the invention is a method of inhibiting the interaction 
between a heregulin polypeptide having the sequence shown in SEQ ID NO: 4 and 
HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising exposing 
10 the heregulin polypeptide to a non-naturally occurring HER3 variant polypeptide 
comprising domains I and II of the HER3 polypeptide sequence of SEQ ID NO: 2; 
wherein the HER3 variant polypeptide specifically binds to the heregulin polypeptide of 
SEQ ID NO: 4; and exhibits an impaired ability to interact with the HER2 polypeptide 
of SEQ ID NO: 6. In preferred embodiments of die invention, the HER3 variant 
15 polypeptide has an amino acid substitution VI 10 in SEQ ID NO: 2. 

Preferred embodiments of the invention also include anti-HER3 antibodies 
which are capable of specifically binding to the heregulin binding domain. In a preferred 
embodiment, the invention provides antibodies which specifically bind to a HER3 
polypeptide and inhibit the binding of heregulin ligand to the HER3 receptor. 
20 Optionally, the antibodies are monoclonal antibodies. 

In one embodiment of the invention, there are provided methods for the use of 
heregulin antagonists to block or neutralize the interaction between heregulin and HER3. 
i For example, the invention provides a method comprising exposing a mammalian cell, 
such as an ovarian or breast cell, to one or more heregulin antagonists in an amount 
25 effective to decrease, neutralize or block activity of the heregulin ligand. The cell may be 
in cell culture or in a mammal, e.g. a mammal suffering from, for instance, cancer. Thus, 
the invention includes a method for treating a mammal suffering from a pathological 
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condition such as cancer comprising administering an effective amount of one or more 
heregulin antagonist, as disclosed herein. 

In addition, the invention provides methods of using HER3 agonists (e.g. anti- 
HER3 agonist antibodies which target the heregulin binding domain) to, for instance, 
5 stimulate or activate HER3 receptor. Such methods will be useful in treating 
pathological conditions characterized by or associated with insufficient HER3 expression 
or activity. The HER3 agonists typically comprise agonistic anti-HER3 antibodies or 
HER3 variant polypeptides. The agonistic activity of such HER3 agonists may comprise 
enhancing the activity of a native ligand for HER3 or activity which is the same as or 

10 substantially the same as (i.e., mimics) the activity of a native ligand for HER3. Typical . 
methods of the invention include methods to treat pathological conditions or diseases in 
mammals associated with or resulting from increased or enhanced HER3 or heregulin 
expression and/or activity. In the methods of treatment, HER3 variant polypeptides 
may be administered which preferably block or reduce the respective receptor binding or 

15 activation by heregulin. 

The invention also provides compositions which comprise one or more HER3 or 
heregulin agonists or antagonists. Optionally, the compositions of the invention will 
include pharmaceutically acceptable carriers or diluents. Preferably, the compositions 
wiLL include one or more HER3 or heregulin antagonists or agonists in an amount which 

20 is therapeutically effective to treat a pathological condition or disease. Optionally, the 
compositions will include one or more HER3 agonists in an amount which is 
therapeutically effective to stimulate signal transduction by HER3. 

The invention also provides articles of manufacture and kits which include one 
or more HER3 or heregulin antagonists or agonists. 

25 



8 



WO 03/0 1 1897 PCT/US02/23963 



BRIEF DESCRIPTION OF THE DRAWINGS 
The file of this patent contains at least one drawing executed in color. Copies of 
this patent with color drawing(s) will be provided by the patent and Trademark Office • 
upon request and payment of the necessary fee. 
5 Fig. 1. A, limited proteolysis of HERS-ECD 1 -^ generates two detectable 

fragments, which remain associated under oxidizing conditions. HER3-ECD I " rv was 
digested in the presence of trypsin (T) and analyzed in the Phast gel in the presence of 
the reducing agent (3-ME. The proteolytic digest of HEItf-ECD 1 -™ was analyzed on an 
SDS-PAGE Phast gel (10-15%) in the absence (lane 4) and the presence (lane 3) of the 

10 reducing agent (S-ME. Nonproteolyzed HER3-ECD I ' IV in the presence of (3-ME is 
shown in lane 2. The two fragments in proteolyzed HER3-ECD I_rv remain linked by one 
or more disulfide bridges. B, a V5 antibody protects HER3-ECD I " IV from cleavage at the 
C terminus. When HER3-ECD I " rv is proteolyzed in the absence of the V5 antibody, the 
C terminus is digested and could not be visualized when probed for the V5 epitope. This 

15 gel shows HER3-ECD I_rv proteolyzed for the time indicated in the presence of an 
antibody against the V5 epitope tag, analyzed on a 4-15% polyacrylamide gradient gel, 
and transferred to a polyvinylidene difluoride membrane. The C-terminal fragment was 
identified by Western blotting using an antibody against the C-terminal V5 epitope. The 
V5 antibody protects the C terminus against proteolytic cleavage and shows that 

20 fragment 2 is the C-terminal fragment. 

Fig. 2. A, nonproteolyzed HER3-ECD I4V binds hrg, but proteolyzed HER3- 
ECD 1 "™ does not. The presence (+) or absence (-) of tryspin in the proteolytic digest is 
indicated above the lanes. HER3-ECD uxv was analyzed on a native Phast gel (10-15%) 
in the absence (lane 1) and presence (lane 3) of equimolar hrg. Proteolyzed HER3-ECD 1 " 

25 w was also analyzed in the absence (lane 2) and presence (lane 4) of hrg. Formation of a 
hrg-HER3-ECD IIV complex is observed with the nonproteolyzed HER3-ECD I IV but 
not with the proteolyzed HER3-ECD IIV . B, nonproteolyzed HER3-ECD I_IV is eluted by 
hrg in a pull-down assay, but proteolyzed HER3-ECD is not. HER3-ECD I " rv was 
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applied untreated (-) or digested in the absence (I) or presence of hrg (T + h) to agarose- 
bound hrg. HER3-ECD irv was detected with an antibody with an epitope in fragment 1. 
HER3-ECD I ' rv was bound to immobilized hrg and could be eluted with free hrg (lane 3) 
and detergent (lane 6), but proteolyzed HER3-ECD I ' rv could not be eluted with hrg 
5 (lane 4) or detergent (lanes 7). HER3-ECD MV that was protected by hrg during 
proteolytic digestion could be recovered by both hrg (lane 5) and detergent (lane 8). 
Trypsin that was treated with phenylmethylsulfonyl fluoride prior to addition to the 
digestion reaction (*) did abolish hrg binding by HER3-ECD I ' rv , which could then be 
recovered by hrg (lane 9). This demonstrates that proteolyzed HER3-ECD I " rv does not 
10 bind hrg. 

Fig. 3. A, limited proteolysis of HER3-ECD I - IV in the presence of hrg 
generates a different cleavage pattern than in the absence of hrg when analyzed on an 
SDS-PAGE Phast gel (1045%) under reducing conditions. HER3-ECD IIV was 
proteolyzed in the absence of ligand (-) and in the presence of hrg, NAs-hrg, and EGF, 

15 as indicated above the lanes. Limited proteolysis of HER3-ECD I ' IV generates two 
fragments: one at an apparent molecular mass of 24.2 kDa (fragment 1) and another at 
54.1 kDa (fragment 2). In the presence of NAs-hrg (lane 4) and EGF (lane 5), fragment 1 
appears to be identical to fragment 1 in the absence of hrg (lane 2), whereas in the 
presence of wild type hrg (lane 3), fragment 1 appears to have a slightly higher molecular 

20 mass. The digest of HERS-ECD 1 ^ in the presence of a binding-competent hrg 
generates a different cleavage pattern than in its absence. Apparently, the cleavage that 
generates fragment 1 can be protected by hrg and is due to a specific interaction between 
hrg and HER3-ECD I_IV . J3, the minimum amount of hrg required to produce a change in 
the molecular weight of fragment 1 in the SDS gel was determined by varying the molar 

25 ratio of hrg to HER3-ECD I - IV . The ratio of hrg to HER3-ECD I - IV was varied from 2.0 
to 0 (lanes 2-8), as indicated above the lanes. A change in the molecular weight of 
fragment 1 was observed when the molar ratio of hrg to HER3~ECD IIV was between 
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0.9:1 and 1:1. Complete protection of HER3-ECD I - IV by hrg was observed at an 
approximately 1:1 ratio of hrg to HEFJ-ECD 1 -™ 

Fig. 4. Domain organization and proteolytic cleavage sites of HER3-ECD I ~ IV . 
The extracellular region of HER3 contains domains I, II, III, and IV. Sequence positions 
5 are indicated at the start of each domain. The beginning of HER3-ECD I ' IV at position 20 
refers to the start of mature HER3-ECD I " rv after cleavage of the leader sequence. The 
various digests that were analyzed are shown to the left die digest of HEl^-ECD 1 '^ in 
the presence (+) and absence (-) of hrg and mass spectrometry of the oxidized (-(3ME) 
and reduced (+|3ME) forms of the digest. The position of the cleavage site as determined 

10 by N-terminal sequencing is indicated at the start of each fragment. Fragment 1 is 
indicated by a gray bar, and fragment 2 is represented by a black bar. Numbers in 
brackets are mass estimates in kDa as determined by mass spectrometry. S-S indicates 
that the two fragments are held together by a disulfide bond. The exact position of the 
cleavage at the C terminus is unknown, and this is indicated by a dashed bar. The 

15 postulated mass of the missing fragments is indicated in brackets below the fragments 
with dashed lines. In summary, hrg protects HER3-ECD from cleavage at position 50 
but not at any other site. 

Fig. 5. HER3-ECD I_rv binds hrg but not EGF or the binding-deficient NA 5 - 
hrg. HER3-ECD I_IV was analyzed on a native Phast gel (10-15%) in the absence of hrg 

20 (lane 1) and in the presence of hrg (lane 2), NAs-hrg (lane 3), and EGF (lane 4). The 
ligands were used at a 1:1 ratio of hrg to HER3-ECD I_IV . A complete shift is observed in 
the presence of hrg, and no shift is apparent in NAs-hrg or EGF. 

Fig. 6. Characterization of recombinant HER3-ECD 111 by SDS-PAGE 
analysis, mass spectrometry, light scattering, and equilibrium centritugation. A, 

25 recombinant HER3-ECD 1 -* 1 is pure as judged by SDS-PAGE analysis. HER3~ECD WI 
was expressed in Drosophila S2 cells and purified from the medium. The molecular mass 
appears to be somewhat less than 40 kDa. B, the molecular mass of HER3-ECD 1 " 11 is 
between 38.5 and 40 kDa as determined by MALDI mass spectrometry, compared with a 
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theoretical molecular mass of 37.7 kDa. This provide evidences that HER3-ECD 1 " 11 has a 
carbohydrate content between 3.3 and 5.7% by weight C, recombinant HER3-ECD MI is 
a monomer as determined by size exclusion chromatography followed by multiangle light 
scattering. Recombinant HER3-ECD 1 - 11 eluted in a single peak (black line) with a 
5 molecular mass of -34 kDa (S.E. = ± 4%) (black circles). The addition of a 1.3-fold 
molar excess of hrg caused a slight shift in the peak (gray line). The shifted peak has a 
molecular mass of 35 kDa (S.E. = ±3%) (gray squares). Recombinant HER3-ECD 111 is a 
monomer and remains a monomer in the presence of hrg. D, recombinant HER3-ECD 1 " 
11 is a monomer, as determined by ultracentxifugation at three different concentrations 

10 and three different speeds. The average molecular mass of HER3-ECD 1 " 11 is 35.0 kDa 
(S.E. = ±0.6%), which corresponds to monomelic HER3-ECD 1-11 . 

Fig. 7. Recombinant HER3-ECD 111 binds hrg. A, HER3-ECD 1 " 11 was analyzed 
in a pull-down assay in which S-tagged hrg was immobilized on S-protein resin (lanes 3- 
5) and could be specifically eluted with 1 uM hrg (lane 4) but not PBS (lane 3) or 1 uM 

15 EGF (lane 5). The fragment showed litde nonspecific binding to S-protein resin without 
immobilized S-tagged hrg (lanes 1 and 2) when eluted with PBS (lane 1) or 1 uM hrg 
(lane 2). This shows that domains I and II of HER3-ECD I,,IV are involved in hrg binding. 
B y recombinant HER3-ECD 1 " 11 has a Kd of 68 nM as determined by SPR (BIAcore). 
HER3-ECD 1-11 showed binding to immobilized trx-hrg with a calculated equilibrium 

20 dissociation constant of 68 nM using the three different concentrations of HER3-ECD 1 * 
11 indicated. This shows that HER3-ECD WI binds hrg. 

Fig. 8. Sequence alignment of domain I of HER3-ECD I_IV , IR, and IGF-1R 
indicates the position of the proteolysis site (T) in domain I of HER3-ECD I ' rv relative 
to mutations (*) that decrease ligand binding in the IR. A naturally occurring mutation 

25 that decreases binding in IR is indicated by an open circle. The "hormone binding 
footprint" in the IGF-1R structure (43) is indicated by shaded areas in the sequence. This 
alignment shows that the proteolysis site HER3-ECD I_rv that is protected by hrg 
corresponds to the analogous putative ligand-binding region in domain LI in IGF-1R. 
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Fig. 9. Structural similarity of IGF-1R to HER3-ECD I - IV . A, space-filling 

c 

model of domains I and II of the structure of IGF-1R (43) showing the positions of the 
proteolysis sites (red) observed for the HER^-ECD 1 "™ mapped to the IGF-1R structure. 
The position of mutations (dark blue) that decrease ligand binding in IR are also mapped 
5 to a binding footprint in the IGF-1R structure. Domain LI is light blue, and L2 is 
lavender in IGF-1R (domains I and III in HER3-ECD I " IV ), and domain SI (domain II in 
HER3-ECD i rv ) is green. 1740 A 2 of accessible surface area in domain LI are buried by 
domain SI in IGF-1R. The cleavage site at position 50 in HER3-ECD I " IV that is 
protected by hrg (position 18 in IGF-1R) lies on the same face of domain LI in IGF-1R 

10 and is 11 A away from residue 10 in the binding footprint of IR/IGF-1R, The 
unprotected cleavage site at position 270 in HER3-ECDI-1V (position 237 in IGF-1R) 
lies in domain II, is on the opposite side as the binding footprint, and is 40 A away from 
the binding footprint. Residue numbers are shown in the binding footprint. The cleavage 
site that is protected by hrg in HER3-ECD IIV is near the binding footprint in domain LI 

15 of IGF-1R. B, backbone representation of domain II of IGF-1R showing the position of 
the proteolysis site at position 270 in HER3-ECD T " rv (red cpk) (position 237 in IGF-1R) 
superimposed on the IGF-1R structure. The chain remains connected by a disulfide (red 
ball and stick) when cleavage occurs at position 270. The disulfide bridge connecting 
fragments I and II of HER3-ECD I_IV is consistent with a structural model based on 

20 ' IR/IGF-1R. 

Fig. 10. A, HER3 1 " n and HER3 WI E64A were analyzed in a "pull down assay" in 
which S-tagged hrg was immobilized on S-protein resin (lanes 3-6) and HER3 1-11 could 
be specifically eluted with 1 |uM hrg and detergent (lane 3 and 5), but HER3 1-11 E64A was 
not eluted by either 1 |iM hrg or detergent (lane 4 and 6). Both HER3 WI and HER3 l ' 
25 HE64A showed iittie non-specific binding to S-protein resin without immobilized S-tagged 
hrg (lanes 1-2). This shows that HER3 I ' n E64A does not bind hrg. B, Recombinant 
HER3 1-11 E64A had no detectable binding greater than 500 nM to the immobilized hrg on 
die BIAcore chip while HER3 I_n showed binding to immobilized trx^hrg with a 
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calculated equilibrium dissociation constant of 68 nM. This shows that HER3 WIE64A 
does not bind hrg. 

Fig. VL Characterization of recombinant HER3 m^64A mass spectrometry 
and circular dichroism. A, the molecular mass of HER3 WI E64A is between 38.4 and 40.6 
5 kDa as determined by MALDI mass spectrometry, which is comparable to the molecular 
mass of HER3 WI (38.5 and 40 kDa). This provides evidence that the polypeptide chain 
of HER3 WIE64A is intact B, the CD spectra of HER3 WI and HER3 l ~ u were essentially 
identical between 210 nM and 260 nM provide evidencing that they have the same 
secondary structure. 

1 0 Fig. 12. A, HER3 «v binds hrg, but HER3 w E64A does not. HER3 and the 

HER3 HV mutants were analyzed on a native Phast gel (10-15%) in the absence (-) and 
presence (+) of equimolar hrg. The HER3 l ~ w E64A was also analyzed in the absence (lane 
2) and presence (lane 4) of hrg. Formation of a hrg HER3 MV complex is observed with 
HER3 I * IV , but not with HER3 *- n E64A . B, HER3 l ~ w and HER3 l ~ w E64A were analyzed in 

15 a "pull down assay" in which S-tagged hrg was immobilized on S-protein resin (lanes 4-9) 
and HER3 l ~ w could be specifically eluted with 1 ftM hrg and detergent (lane 6 and 8), 
but HER3 l - w ™> AA could not be eluted with PBS or lu^M hrg (lanes 5 and 7). A small 
amount of HER3 I-IVE64A cou ld be eluted with detergent (lane 9) which was probably due 
to some binding or non-specific interaction. Both HER3 l -™ and HER3 E64A showed 

20 little non-specific binding to S-protein resin without immobilized S-tagged hrg when 
eluted with 1 hrg (lane 2-3). This shows that HER3 ^ E64A does not bind hrg. 

Fig. 13. Space filling model of domains I, II, and III of the structure of IGF-1R 
showing the positions of the protected proteolysis site (red) and residues E64 (yellow) 
and VI 10 (green) in the HER3" V mapped to the IGF-1R structure (see, e.g. Garrett et 

25 al. (1998) Nature 394(6691), 395-9). The hormone binding footprint is in purple. E64 
and VI 10 of HER3 I " IV are in or near the binding footprint in domain LI of IGF-1R. 

Fig, 14. Residues 64 and 110 could form specific interactions with heregulin. A, 
Backbone representation of domain I of IGF-1R showing the position of E64 (yellow 
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cpk) and VI 10 (green cpk) (H30 and L81 in IGF-1R) the residues critical for heregulin 
binding and the protected proteolysis site at position 50 in HER3 MV (red cpk) (18 in 
IGF-1R) superimposed on the IGF-1R structure. E64 (H30 in IGF-1R) and VI 10 (L81 
in IGF-1R) which are critical for heregulin binding are 12 A from each other and E64 
5 (H30 in IGF-1R) is 13 A from residue 50 (position 18 in IGF-1R) the protected cleavage 
site. B, Backbone representation of the EGF-like domain of heregulin showing the 
position of H178 (yellow cpk) and VI 80 (green cpk). HER3 VI 10 and hrg V180 could 
form hydrophobic interactions. HER3 E64 and hrg HI 78 could form a salt bridge. 
These specific interactions could be disrupted by the, mutations of E64A and VI 1 OA in 

10 HER3 as we observed in our biochemical experiments. 

Fig. 15. The N-terminal residues HI 78 (yellow cpk) and VI 80 (green cpk) of hrg 
can form interactions with the residues E64 (yellow cpk) and VI 10 (green cpk) of HER3 
and that the omega loop of hrg could block the protected proteolysis site (residue 50 red 
cpk) by sterically hindering interaction with trypsin. 

15 Fig. 16. HER3 ^ (residues 20-1342) binds heregulin, but HER3 ^ E64A (residues 

20-1342 with a mutation of E64A) and HER3 II CTD (Domain I in figure, residues 184- 
1342) do not. A, Specific binding of I 125 hrg to COS 7 cells transiently expressing 
HER3 mutants. The FL HER3 shows a high level of specific binding while the HER3 FL 
E64A and HER3 llm ^ rD show similar binding to cells expressing the vector only. B, 

20 Western blot analysis of transient expression of the HER3 mutants in COS7 cells. The 
WT (HER3 PL), E64A, and Domain I all express similar levels of protein. 

DETAILED DESCRIPTION OF THE INVENTION 

25 Unless otherwise defined, all terms of art, notations and other scientific terms or 

terminology used herein are intended to have the meanings commonly understood by 
those of skill in the art to which this invention pertains. In some cases, terms with 
commonly understood meanings are defined herein for clarity and/or for ready 
reference, and the inclusion of such definitions herein should not necessarily be 
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construed to represent a substantial difference over what is generally understood in the 
art Many of the techniques and procedures described or referenced herein are well 
understood and commonly employed using conventional methodology by those skilled 
in the art, such as, for example, the widely utilized molecular cloning methodologies 
5 described in see Ausubel et al., Current Protocols in Molecular Biology, Wiley 
Interscience Publishers, (1995) and Sambrook et al., Molecular Cloning A Laboratory 
Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y. As appropriate, procedures involving the use of commercially available kits and 
reagents are generally carried out in accordance with manufacturer defined protocols 

10 and/or parameters unless otherwise noted. 

Briefly, the disclosure provided herein identities domains I-II of HER3 as the 
major binding region for heregulin and identifies residue positions in HER3 that, when 
mutated, produce heregulin binding polypeptides which have an altered affinity for this 
ligand. Prior to this discovery, the region of ligand binding in HER3 was unknown. 

15 This domain localization is unexpected in view of observations of EGFR (in which the 
binding of its ligand occurs in domain III), a closely related receptor that has a sequence 
identity HER3 and which belongs in the same family of type 1 kinase receptors. See, e.g. 
Singer et al., JBC (2001), 276(47): 44266-44274. Various aspects and embodiments of 
the invention disclosed herein are provided in the description below. 

20 

I. Abbreviations 

Abbreviations used include: HER, human epidermal growth factor receptor; 
ECD, extracellular domain; EGF, epidermal growth factor; EGFR, epidermal growth 
factor receptor; hrg, 60-residue EGF-like domain of human heregulin 1; IGF-1R, insulin 
25 growth factor-1 receptor; IR, insulin receptor; NAs-hrg, binding-deficient hrg mutant; 
PBS, phosphate-buffered saline; RTK, receptor tyrosine kinase; trx-hrg, thioredoxin- 
heregulin fusion; p~ME, p-mercaptoethanol; PAGE, polyacrylamide gel electrophoresis; 
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BSA, bovine serum albumin; MALDI, matrix-assisted laser desorption/ionization; SPR, 
surface plasmon resonance; MES, 4-morpholineethanesulfonic acid. 

II. Definitions 

5 The terms "HER3", "HER3 polypeptide" or "HER3 receptor" when used herein 

encompass "native sequence HER3 polypeptides" and "HER3 variants" (which are 
further defined herein). "HER3" is a designation given to those polypeptides which are 
encoded by the nucleic acid molecules comprising the polynucleotide sequences shown 
in SEQ ID NO: 1 and variants or fragments thereof, nucleic acid molecules comprising 

10 the sequence shown in the SEQ ID NO: 1. The HER3 polynucleotides and 
polypeptides of the invention may be isolated from a variety of sources, such as from 
human tissue types or from another source, or prepared by recombinant and/or 
synthetic methods. See, e.g. Plowman et aL, PNAS (1990) 87: 4905-4909. As used 
herein, the term "polypeptide" means an amino acid polymer of at least 6 amino acids. 

15 A "native sequence" HER3 polypeptide comprises a polypeptide having the same 

amino acid sequence as the corresponding HER3 polypeptide derived from nature. Such 
native sequence HER3 polypeptides can be isolated from nature or can be produced by 
recombinant and/or synthetic means. The term "native sequence HER3 polypeptide" 
specifically encompasses naturally-occurring truncated or secreted forms (e.g., an 

20 extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively 
spliced forms) and naturally-occurring allelic variants of the polypeptide. The HER3 
polypeptides of the invention include the HER3 polypeptide comprising or consisting of 
the contiguous sequence of amino acid residues 1 to 1342 of SEQ ID NO: 2. 

A HER3 "extracellular domain" or "ECD" refers to a form of the HER3 

25 polypeptide which is essentially free of the transmembrane and cytoplasmic domains. 
Ordinarily, a HER3 polypeptide ECD will have less than about 1% of such 
transmembrane and/ or cytoplasmic domains and preferably, will have less than about 
0.5% of such domains. It will be understood that any transmembrane domain(s) 
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identified for die HER3 polypeptides of the present invention are identified pursuant to 
criteria routinely employed in the art for identifying that type of hydrophobic domain. 
The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 arnino acids at either end of the domain as initially identified. 
5 "HER3 variant" means a HER3 polypeptide having at least about 80% amino 

acid sequence identity with the amino acid sequence of a native sequence full length 
HER3 or HER3 ECD. Such HER3 variant polypeptides include, for instance, HER3 
polypeptides wherein one or more amino acid residues are added, or deleted, at the N- 
and/or C-terminus, as well as witriin one or more internal domains, of the full-length 

10 amino acid sequence. Fragments of the HER3 ECD are also contemplated. Ordinarily, 
a HER3 variant polypeptide will have at least about 80% amino acid sequence identity, 
more preferably at least about 81% amino acid sequence identity, more preferably at least 
about 82% amino acid sequence identity, more preferably at least about 83% arnino acid 
sequence identity, more preferably at least about 84% amino acid sequence identity, more 

15 pteferably at least about 85% arnino acid sequence identity, more preferably at least 
about 86% amino acid sequence identity, more preferably at least about 87% arnino acid 
sequence identity, more preferably at least about 88% amino acid sequence identity, more 
preferably at least about 89% arnino acid sequence identity, more preferably at least 
about 90% arnino acid sequence identity, more preferably at least about 91% amino acid 

20 sequence identity, more preferably at least about 92% amino acid sequence identity, more 
preferably at least about 93% arnino acid sequence identity, more preferably at least 
about 94% amino acid sequence identity, more preferably at least about 95% amino acid 
sequence identity, more preferably at least about 96% amino acid sequence identity, more 
preferably at least about 97% arnino acid sequence identity, more preferably at least 

25 about 98% amino acid sequence identity and yet more preferably at least about 99% 
arnino acid sequence identity with a HER3 polypeptide encoded by a nucleic acid 
molecule shown in Figure 10 or a specified fragment thereof. HER3 variant 
polypeptides do not encompass the native HER3 polypeptide sequence. Ordinarily, 
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HER3 variant polypeptides are at least about 10 amino acids in length, often at least 
about 20 amino acids in length, more often at least about 30 amino acids in length, more 
often at least about 40 amino acids in length, more often at least about 50 amino acids in 
length, more often at least about 60 amino acids in length, more often at least about 70 
5 amino acids in length, more often at least about 80 arnino acids in length, more often at 
least about 90 amino acids in length, more often at least about 100 amino acids in length, 
more-often at least about 150 amino acids in length, more often at least about 200 amino 
acids in length, more often at least about 250 amino acids in length, more often at least 
about 300 amino acids in length, or more. 

10 The terms "heregulin" or "heregulin polypeptide" when used herein encompass 

"native sequence heregulin polypeptides" and "heregulin variants". "Heregulin" is a 
designation given to those polypeptides which are encoded by the nucleic acid molecules 
comprising the polynucleotide sequences shown in SEQ ID NO: 3 and variants thereof, 
nucleic acid molecules comprising the sequence shown in the SEQ ID NO: 3, and 

15 variants thereof as well as fragments of the above which have the biological activity of 
the native sequence heregulin. Variants of heregulin will preferably have at least 80%, 
more preferably, at least 90%, and even more preferably, at least 95% amino acid 
sequence identity with the native sequence heregulin polypeptide shown in SEQ ID NO: 
4. A "native sequence" heregulin polypeptide comprises a polypeptide having the same 

20 amino acid sequence as the corresponding heregulin polypeptide derived from nature. 
Such native sequence heregulin polypeptides can be isolated from nature or can be 
produced by recombinant and/or synthetic means. 

The terms "HER2" or "HER2 polypeptide" when used herein encompass 
"native sequence HER2 polypeptides" and "HER2 variants". "HER2" is a designation 

25 given to those polypeptides which are encoded by the nucleic acid molecules comprising 
the polynucleotide sequences shown in SEQ ID NO: 5 and variants thereof, nucleic acid 
molecules comprising the sequence shown in the Figure SEQ ID NO: 5, and variants 
thereof as well as fragments of the above which have the biological activity of the native 
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sequence HER2. Variants of HER2 will preferably have at least 80%, more preferably, 
at least 90%, and even more preferably, at least 95% amino acid sequence identity with 
the native sequence HER2 polypeptide shown in SEQ ID NO: 6. A "native sequence" 
HER2 polypeptide comprises a polypeptide having the same amino acid sequence as the 
5 corresponding HER2 polypeptide derived from nature. Such native sequence HER2 
polypeptides can be isolated from nature or can be produced by recombinant and/or 
synthetic means. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary 
skill in the art, and generally is an empirical calculation dependent upon probe length, 

10 washing temperature, and salt concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes need lower temperatures. 
Hybridization generally depends on the ability of denatured DNA to re-anneal when 
complementary strands are present in an environment below their melting temperature. 
The higher the degree of desired identity between the probe and hybridizable sequence, 

15 the higher the relative temperature which can be used. As a result, it follows that higher 
relative temperatures would tend to make the reaction conditions more stringent, while 
lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et a/., Current Protocols in Molecular Biology . Wiley 
Interscience Publishers, (1995). 

20 "Stringent conditions" or "high stringency conditions", as defined herein, are 

identified by those that (1) employ low ionic strength and high temperature for washing, 
0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; 
(2) employ during hybridization a denaturing agent, 50% (v/v) formamide with 0.1% 
bovine serum albumin/0.1% Ficoll/0.1% polyvmylpyrtohdone/50mM sodium 

25 phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 
42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 
mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 ug/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, 
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with washes at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide 
at 55°C, followed by a high-stringency wash consisting of 0.1 x SSC containing EDTA at 
55°C 

"Moderately stringent conditions" are identified as described by Sambrook et aL, 
5 Molecular Cloning: A Laboratory Manual , New York: Cold Spring Harbor Press, 1989, 
and include the use of washing solution and hybridization conditions (eg., temperature, 
ionic strength and %SDS) less stringent that those described above. An example of 
moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 
20% formamide, 5 x SSC (150 mM NaCl, 15 mM ttisodium citrate), 50 rnM sodium 

10 phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml 
denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at 
about 37-50°C. The skilled artisan will recognize how to adjust the temperature, ionic 
strength, etc. as necessary to accommodate factors such as probe length and the like. 

Nucleic acid is "operably linked" when it is placed into a functional relationship 

15 with another nucleic acid sequence. For example, DNA for a presequence or secretory 
leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably 
linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate 

20 translation. Generally, "operably linked" means that the DNA sequences being linked 
are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation 
at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

25 The terms "amino acid" and "amino acids" refer to all naturally occurring L- 

alpha-amino acids. This definition is meant to include norleucine, ornidiine, and 
homocysteine. The amino acids are identified by either the single-letter or three-letter 
designations: 
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Asp D aspartic acid He I isoleucine 

Thr T threonine Leu L leucine 

Ser S serine Tyr Y tyrosine 

Glu E glutamic acid Phe F phenylalanine 

5 Pro P proline His H hisridine 

Gly G glycine Lys K lysine 

Ala A alanine Arg R arginine 

Cys C cysteine Trp W tryptophan 

Val V valine Gin Q glutamine 

10 Met M methionine Asn N asparagine 

In the Sequence Listing and Figures, certain other single-letter or three-letter 
designations may be employed to refer to and identify two or more amino acids or 
nucleotides at a given position in the sequence. 

"Percent (%) amino acid sequence identity" with respect to the ligand or receptor 

15 polypeptide sequences identified herein is denned as the percentage of amino acid 
residues in a candidate sequence that are identical with the amino acid residues in such a 
ligand or receptor sequence identified herein, after aligning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity, and 
not considering any conservative substitutions as part of the sequence identity. 

20 Alignment for purposes of deterrruning percent amino acid sequence identity can be 
achieved in various ways that are within the skill in the art, for instance, using publicly 
available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign 
(DNASTAR) software. Those skilled in the art can determine appropriate parameters 
for measuring alignment, including any algorithms needed to achieve maximal alignment 

25 over the full-length of the sequences being compared. 

The term "antagonist" is used in the broadest sense, and includes any molecule 
that partially or fully blocks, inhibits, or neutralizes one or more biological activities of 
heregulin polypeptide, HER3 polypeptide, or both heregulin and HER3, in vitro, in situ, or 
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in vivo. Examples of such biological activities of heregulin and HER3 polypeptides 
include binding of heregulin to HER3 and activation of the autogenic response that is 
observed when heregulin interacts with HER3, as reported in the literature (see, e.g. Lee 
et aL, (2001) Cancer Research 61, 4467-4473; Aguilar et aL, (1999) Oncogene 18, 6050- 
5 6052; Marte et al, Oncogene 1995 Jan 5;10(1):1 67-75; and Neve et aL, Oncogene 2000 
Mar 23;19(13):1647-56). An antagonist may function in a direct or indirect manner. For 
instance, the antagonist may function to partially or fully block, inhibit or neutralize one 
or more biological activities of heregulin polypeptide, HER3 polypeptide, or both 
heregulin and HER3, in vitro, in situ, or in vivo as a result of their direct interaction. The 

10 antagonist may also function indirectly to partially or fully block, inhibit or neutralize one 
or more biological activities of heregulin polypeptide, HER3 polypeptide, or both 
heregulin and HER3, in vitro, in situ, or vivo as a result of, e.g., blocking or inhibiting 
another effector molecule. The antagonist molecule may comprise a "dual" antagonist 
activity wherein the molecule is capable of partially or fully blocking, inhibiting or 

15 neutralizing a biological activity of both heregulin and HER3. 

The term "agonist" is used in the broadest sense, and includes any molecule that 
partially or fully enhances, stimulates or activates one or more biological activities of 
heregulin polypeptide, HER3 polypeptide, or both heregulin and HER3, in vitro, in situ, or 
in vivo. Examples of such biological activities of heregulin and HER3 may include 

20 activation of downstream signal transduction (see, e.g. Aguilar et al, (1999) Oncogene 
18, 6050-6052; Marte et aL, Oncogene 1995 Jan 5;10(1):1 67-75; and Neve et aL, 
Oncogene 2000 Mar 23;19(13):1647-56), An agonist may function in a direct or indirect 
manner. For instance, the agonist may function to partially or fully enhance, stimulate or 
activate one or more biological activities of heregulin polypeptide, HER3 polypeptide, or 

25 both heregulin and HER3, in vitro, in situ, or in vivo as a result of its direct binding to 
heregulin or HER3, which causes receptor activation or signal transduction. The agonist 
may also function indirecdy to partially or fully enhance, stimulate or activate one or 
more biological activities of heregulin polypeptide, HER3 polypeptide, or both heregulin 
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and HER3, in vitw^ in situ y or in vivo as a result of, e.g., stimulating another effector 
molecule which then causes heregulin or HER3 receptor activation or signal 
transduction. It is contemplated that an agonist may act as an enhancer molecule which 
functions indirectly to enhance or increase heregulin or HER3 activation or activity. For 
5 instance, the agonist may enhance activity of endogenous heregulin or HER3 in a 
mammal. This could be accomplished, for example, by pre-complexing heregulin or 
HER3 or by stabilizing complexes of the respective ligand with the heregulin or HER3 
receptor (such as stabilizing native complex formed between heregulin and HER3). 

The term '"heregulin antagonist" or "HER3 antagonist" refers to any molecule that 

10 partially or fully blocks, inhibits, or neutralizes a biological activity of heregulin or HER3, 
respectively, or both heregulin and HER3, and include, but are not limited to, soluble 
forms of HER3 receptor or such as an extracellular domain sequence of HER3, HER3 
receptor fusion proteins, covalently modified forms of HER3 receptor, covalendy modified 
forms of HER3 receptor, HER3 variants, heregulin variants, HER3 receptor antibodies 

15 and heregulin antibodies. To determine whether a heregulin antagonist molecule partially 
or fully blocks, inhibits or neutralizes a biological activity of heregulin or HER3, assays may 
be conducted to assess the effect(s) of the antagonist molecule on, for example, binding of 
heregulin to HER3. Such assays may be conducted in known in vitro or in vivo assay 
formats, for instance, in cells expressing HER3. Preferably, the heregulin antagonist 

20 employed in the methods described herein will be capable of blocking or neutralizing at 
least one type of heregulin activity, which may optionally be determined in assays such as 
described herein. To determine whether an HER3 antagonist molecule partially or fully 
blocks, inhibits or neutralizes a biological activity of heregulin or HER3, assays may be 
conducted to assess the effect(s) of the antagonist molecule on, for example, binding of 

25 heregulin to HER3. Such assays may be conducted in known in vitro or in vivo formats, for 
instance, using cells transfected with HER3. Preferably, the HER3 antagonist employed in 
the methods described herein will be capable of blocking or neutralizing at least one type of 
HER3 activity. Optionally, a heregulin antagonist or HER3 antagonist will be capable of 
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reducing or inhibiting binding of heregulin to HER3 by at least 50%, preferably, by at least 
90%, more preferably by at least 99%, and most preferably, by 100%, as compared to a 
negative control molecule, in a binding assay. In one embodiment, the heregulin antagonist 
or HER3 antagonist will comprise antibodies which will competitively inhibit the binding 
5 of heregulin to HER3. Methods for determining antibody specificity and affinity by 
competitive inhibition are known in the art (see, e.g., Harlow et al., Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1998); Colligan et al., Current Protocols in Immunology . Green Publishing Assoc., NY 
(1992; 1993); Muller, Meth. Enzym. r 92:589-601 (1983). 

10 The term "heregulin agonist" or "HER3 agonist" refers to any molecule that 

partially or fully enhances, stimulates or activates a biological activity of heregulin or HER3, 
respectively, or both heregulin and HER3, and include, but are not limited to HER3 variant 
polypeptides and heregulin binding domain specific anti-HER3 receptor antibodies. To 
determine whether a heregulin agonist molecule partially or fully enhances, stimulates, or 

15 activates a biological activity of HER3, assays may be conducted to assess the effect(s) of 
the agonist molecule on, for example, HER3-expressing or HER3-transfected cells. Such 
assays may be conducted in known in vitro or /// vivo assay formats. Preferably, the heregulin 
agonist employed in the methods described herein will be capable of enhancing or 
activating at least one type of heregulin activity, which may optionally be determined in 

20 assays such as described herein. 

The term "antibody" is used in the broadest sense and specifically covers, for 
example, single monoclonal antibodies against heregulin and HER3 (e.g. those targeting 
the heregulin binding domain in HER3), antibody compositions with polyepitopic 
specificity, single chain antibodies, and fragments of antibodies. "Antibody" as used 

25 herein includes intact immunoglobulin or antibody molecules, polyclonal antibodies, 
multispecific antibodies (Le., bispecific antibodies formed from at least two intact 
antibodies) and immunoglobulin fragments (such as Fab, F(ab')2, or Fv), so long as they 
exhibit any of the desired agonistic or antagonistic properties described herein. 
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Antibodies are typically proteins or polypeptides which exhibit binding specificity 
to a specific antigen. Native antibodies are usually heterotettameric glycoproteins, 
composed of two identical light (L) chains and two identical heavy (H) chains. Typically, 
each light chain is linked to a heavy chain by one covalent disulfide bond, while the number 
5 of disulfide linkages varies between the heavy chains of different immunoglobulin isotypes. 
Each heavy and light chain also lias regularly spaced intrachain disulfide bridges. Each 
heavy chain has at one end a variable domain (Vh) followed by a number of constant 
domains. Each light chain has a variable domain at one end (V l) and a constant domain at 
its other end; the constant domain of the light chain is aligned with the first constant 

10 domain of the heavy chain, and the light chain variable domain is aligned with the variable 
domain of the heavy chain. Particular amino acid residues are believed to form an interface 
between the light and heavy chain variable domains (Chothia et aL, J. MoL BioL 186 :651- 
663 (1985); Novotny and Haber, Proc. Natl. Acad. Sci. USA . 82:4592-4596 (1985)). The 
light chains of antibodies from any vertebrate species can be assigned to one of two clearly 

15 distinct types, called kappa and lambda, based on the amino acid sequences of their 
constant domains. Depending on the amino acid sequence of the constant domain of their 
heavy chains, immunoglobulins can be assigned to different classes. There are five major 
classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be 
further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3, and IgG-4; IgA~l and 

20 IgA-2. The heavy chain constant domains that correspond to the different classes of 
immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 

"Antibody fragments" comprise a portion of . an intact antibody, generally the 
antigen binding or variable region of the intact antibody. Examples of antibody fragments 
include Fab, Fab', F(ab")2, and Fv fragments, diabodies, single chain antibody molecules, 

25 and multispecific antibodies formed from antibody fragments. 

The term "variable" is used herein to describe certain portions of the variable 
domains which differ in sequence among antibodies and are used in the binding and 
specificity of each particular antibody for its particular antigen. However, the variability is 
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not usually evenly distributed through the variable domains of antibodies. It is typically 
concentrated in three segments called complementarity determining regions (CDRs) or 
hypervariable regions both in the light chain and the heavy chain variable domains. The 
more highly conserved portions of the variable domains are called the framework (FR). 
5 The variable domains of native heavy and light chains each comprise four FR regions, 
largely adopting a P-sheet configuration, connected by three CDRs, which form loops 
connecting, and in some cases forming part of, the P-sheet structure. The CDRs in each 
chain are held together in close proximity by the FR regions and, with the CDRs from the 
other chain, contribute to the formation of the antigen binding site of antibodies (see 

10 Kabat, E.A. et aL, Sequences of Proteins of Immunological Interest National Institutes of 
Health, Bethesda, MD (1987)). The constant domains are not involved direcdy in binding 
an antibody to an antigen, but exhibit various effector functions, such as participation of 
the antibody in antibody-dependent cellular toxicity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained 

15 from a population of substantially homogeneous antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturaUy-occurring mutations 
that may be present in minor amounts. Monoclonal antibodies are highly specific, being 
directed against a single antigenic site. Furthermore, in contrast to conventional 
(polyclonal) antibody preparations which 4 typically include different antibodies directed 

20 against different determinants (epitopes), each monoclonal antibody is directed against a 
single determinant on the antigen. 

The monoclonal antibodies herein include chimeric, hybrid and recombinant 
antibodies produced by splicing a variable (including hypervariable) domain of the antibody 
of interest with a constant domain (e.g. "humanized" antibodies), or a light chain with a 

25 heavy chain, or a chain from one species with a chain from another species, or fusions with 
heterologous proteins, regardless of species of origin or immunoglobulin class or subclass 
designation, as well as antibody fragments (e.g., Fab, F(ab')2, and Fv), so long as they exhibit 
the desired biological activity or properties. See, e.g. U.S. Pat. No. 4,816,567 and Mage et 
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aL, ia Monoclonal Antibody Production Techniques and Applications , pp.79-97 (Marcel 
Dekker, Inc.: New York, 1 987). 

The term "immunospecific" as used in "immunospecific binding of antibodies" 
for example, refers to the antigen specific binding interaction that occurs between the 
5 antigen-combining site of an antibody and the specific antigen recognized by that 
antibody. 

"Isolated," when used to describe the various proteins disclosed herein, means 
protein that has been identified and separated and/or recovered from a component of its 
natural environment Contaminant components of its natural environment are materials 
10 that would typically interfere widi diagnostic or therapeutic uses for the protein, and may 
include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In 
preferred embodiments, the protein will be purified (1) to a degree sufficient to obtain at 
least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup 
sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 
15 conditions using Coomassie blue or, preferably, silver stain. Isolated protein includes 
protein in situ within recombinant cells, since at least one component of the protein natural 
environment will not be present. Ordinarily, however, N isolated protein will be prepared by 
at least one purification step. 

"Treatment" or "therapy" refer to both therapeutic treatment and prophylactic or 
20 preventative measures. 

"Mammal" for purposes of treatment or therapy refers to any animal classified as 
a mammal, including humans, domestic and farm animals, and zoo, sports, or pet 
animals, such as dogs, horses, cats, cows, etc Preferably, the mammal is human. 

"HER3— related pathological condition" and "heregulin-related pathological 
25 condition" refer to pathologies or conditions associated with abnormal levels of 
expression or activity of HER3 or heregulin, respectively, in excess of, or less than, levels 
of expression or activity in normal healthy mammals, where such excess or diminished 
levels occur in a systemic, localized, or particular tissue or cell type or location in the 
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body. HER3 -related pathological conditions and heregnlin-related pathological 
conditions include syndromes characterized by disregulated cell growth such as cancer. 

The terms "cancer", "cancerous", and "malignant" refer to or describe the 
physiological condition in mammals that is typically characterized by unregulated cell 
5 growth. Examples of cancer include but are not limited to, carcinoma including 
adenocarcinoma, lymphoma, blastoma, melanoma, sarcoma, and leukemia. More 
particular examples of such cancers include squamous cell cancer, small-cell lung cancer, 
non-small cell lung cancer, gastrointestinal cancer, Hodgkin's and non-Hodgkin's 
lymphoma, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer 

10 such as hepatic carcinoma and hepatoma, bladder cancer, breast cancer, colon cancer, 
colorectal cancer, endometrial carcinoma, myeloma (such as multiple myeloma), salivary 
gland carcinoma, kidney cancer such as renal cell carcinoma and Wilms' tumors, basal 
cell carcinoma, melanoma, prostate cancer, vulval cancer, thyroid cancer, testicular 
cancer, esophageal cancer, and various types of head and neck cancer. Optionally, the 

15 cancer will express, or have associated with the cancer cell, heregulin or HER3. By way 
of example, breast and ovarian cancers are among the cancers reported in the literature 
to express HER3. 

II. Methods and Materials 

20 The disclosure provided herein teaches that the EGF-like domain of heregulin 

(hrg) binds to domains I and II of HER3, in contrast to the EGF receptor, for which 
prior studies have shown that a construct consisting of domains III and portions of 
domain IV binds EGF. As illustrated in Example 2, a hrg binding site is identified by 
limited proteolysis of the recombinant extracellular domains of HER3 (HER3-ECD I_rv ) 

25 in both the presence and absence of hrg. In the absence of hrg, HER3-ECD I_IV is 
cleaved after position Tyr 50 , near the beginning of domain L Binding of hrg to HER3- 
ECD IIV fully protects position Tyr 50 from proteolysis. To confirm that domain I 
contains a hrg binding site, a HER3 polypeptide comprising only domains I and II (and 
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not domains III and IV) of HER3 (HER3-ECD 1 " 11 ) was generated. This HER3-ECD 1 " 11 
polypeptide was then shown to bind hrg with 68 nM affinity. These data provide 
evidence that domains I and II of HER3-ECD I IV act as a functional unit in folding and 
binding of hrg. 

5 The biochemical findings reinforce the structural hypothesis that HER3-ECD I-IV 

is similar to die insulin-like growth factor-1 receptor (IGF-1R), as follows: 1) The 
protected cleavage site in HER3-ECD WV corresponds to a binding footprint in domain I 
of IGF-1R; 2) HER3-ECD MI binds hrg with a 68 nM dissociation constant, supporting 
the hypothesis that domain I is involved in ligand binding; and 3) the large accessible • 

10 surface area (1749 A) of domain LI of IGF-1R that is buried by domain SI, as well as the 
presence of conserved contacts in this interface of type 1 RTKs, provide evidences that 
domains LI and SI of IGF-1R function as a unit as observed for HER3-ECD 1 " 11 . The 
results are consistent with a model wherein HER3 has a structure similar to IGF-1R and 
binds ligand at a site in corresponding domains. 

15 In this context, the invention provides methods and materials for identifying 

molecules which bind to the heregulin binding site on HER3 as well as methods and 
materials for modulating heregulin and/or HER3 activity in mammalian cells. Typical 
methods for modulating heregulin and/ or HER3 activity in mammalian cells comprise 
exposing the cells to a desired amount of antagonist or agonist that affects heregulin 

20 interaction with HER3. Preferably, the amount of antagonist or agonist employed will 
be an amount effective to affect the binding and/or activity of the respective ligand or 
respective receptor to achieve a therapeutic effect This can be accomplished in vivo or ex 
vivo in accordance, for instance, with the methods described below and in the Examples. 
An exemplary condition or disorders to be treated with such antagonists or antagonists is 

25 a cancer whose malignant phenotype is associated with aberrant HER3 or heregulin 
expression. 

Diagnostic methods are also provided herein. For instance, the antagonists or 
agonists may be employed to detect the respective binding partner (heregulin and HER3) 
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or in mammals known to be or suspected of having a heregulin-related pathological 
condition or HER3-related pathological condition. The antagonist or agonist molecule 
may be used, e.g., in immunoassays to detect or quantitate heregulin or HER3 in a 
sample. A sample, such as cells obtained from a mammal, can be incubated in the 
5 presence of a labeled antagonist or agonist molecule, and detection of the labeled 
antagonist or agonist bound in the sample can be performed. Such assays, including 
various clinical assay procedures, are known in the art 

The antagonists and agonists which can be employed in the methods include, but 
are not limited to, soluble forms of HER3 receptor, HER3 receptor immunoadhesins, 
10 fusion proteins comprising HER3, covalendy modified forms of heregulin and HER3, 
HER3 receptor variants anti-HER3 receptor antibodies that target the heregulin binding 
domain, and anti-heregulin antibodies that target the HER3 binding domain. Various 
techniques that can be employed for making the antagonists and agonists are described 
herein. 

15 

A. HER3 POLYPEPTIDES 

The disclosure provided herein includes HER3 polypeptides such as the HER 3- 
ECD 1 " 11 polypeptide discussed above as well as HER3 substitution variants such as those 
having an amino acid substitution at residue E64 and/or VI 10. The description below 

20 provides typical schemes for the production of HER3 polypeptides including HER3 
polypeptide variants, by culturing cells transformed or transfected with a vector 
containing HER3 polypeptide encoding nucleic acid It is, of course, contemplated that 
alternative methods, which are well known in the art, may be employed to prepare HER3 
polypeptides. For instance, the HER3 polypeptide sequence, or portions thereof, may be 

25 produced by direct peptide synthesis using solid-phase techniques (see, e.g., Stewart et al., 
Solid-Phase Peptide Synthesis . W.H. Freeman Co., San Francisco, C A (1969); Merrifield, 
J. Am. Chem. Soc, 85:2149-2154 (1963)). In vitro protein synthesis may be performed 
using manual techniques or by automation. Automated synthesis may be accomplished, 
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for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using 
manufacturer's instructions. Various portions of HER3 polypeptides may be chemically 
synthesized separately and combined using chemical or enzymatic methods to produce a 
full-length HER3 polypeptide. 
5 This disclosure identifies amino acids in discontinuous segments in domain I of 

HER3 which appear to be critical for heregulin binding (see, e.g. Table 3). For example, 
a mutation at residue 64 appears to decrease all binding affinity in both recombinant 
HER3 wv E64A and HER3 eha, while a mutation of V110A in and HER3 » causes a 
4 fold increase in heregulin binding affinity. Residues 64 and 110 are in domain I of 

10 HER3 and map to residues H30 and L81 in the hotmone binding footprint in the IGF- 
1R. Residues H30 and L81 lie on different strands in IGF-1R, but are within 12A of 
each other. HER3 VI 10 and hrg VI 80 could form hydrophobic interactions, and HER3 
E64 and hrg HI 78 could form a salt bridge with heregulin. These specific interactions 
could be disrupted by the mutations of E64A and VI 1 OA in HER3 as we observed in 

15 our biochemical experiments. 

A preferred embodiment of the invention is a HER3 variant polypeptide 
comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 
substitutions at the following residue position(s) in SEQ ID NO: 2: E43; N44; K51; E64; 

20 V66; and VI 10. Typically, the variant binds heregulin with an affinity different from a 
HER2 polypeptide that has no amino acid substitution at the enumerated position(s). 
Another preferred embodiment of the invention is a HER3 variant polypeptide 
comprising an amino acid sequence which differs from the native sequence HER3 
polypeptide sequence of SEQ ID NO: 2 and has one or more of amino acid 

25 substitutions at the residue positions identified in Table 3. Another embodiment of the 
invention is a non-naturally occurring HER3 variant polypeptide consisting essentially of 
amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2; 
wherein the HER3 variant polypeptide specifically binds to the heregulin polypeptide of 
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SEQ ID NO: 4 and exhibits an impaired ability to interact with the HER2 polypeptide of 
SEQ ID NO: 6. Related embodiments of this invention include an isolated nucleic acid 
comprising a nucleotide sequence encoding these HER3 variants, vectors comprising 
nucleic acids encoding these variants and host cells (e.g. E. coli) containing these vectors. 
5 Another related embodiment of the invention is a method of making HER3 variant 
polypeptide by providing a host cell comprising a vector encoding a HER3 variant, 
providing culture media, culturing the host cell in the culture media under conditions 
sufficient to express the HER3 variant polypeptide, recovering the HER3 variant 
polypeptide from the host cell or culture media, and then purifying the HER3 variant 
10 polypeptide. Optionally the HER3 variant polypeptide is conjugated or linked to one or 
more polyol groups. 

The HER3 variants disclosed herein have a number of uses, and can, for 
example, be used to elucidate the signalling mechanisms of the heregulin/HER2/HER3 
complex, a complex which, as noted above, is implicated in the malignant phenotype of 

15 certain cancers such as breast and ovarian cancers. Specifically, as noted herein, the 
molecules of the heregulin/HER2/HER3 complex contain different domains having 
different functional properties including an ability to associate in HER3 homo-oligomers 
as well as HER2/HER3 hetero-oligomers. The HER2/HER3 hetero-oligomers 
associated with heregulin produce a functional signalling complex that is associated with 

20 the biological activity of the molecules. Consequently, the variants disclosed herein 
which are designed to target a HER3 domain that binds to heregulin and to, for example, 
bind with a greater or lesser affinity to heregulin, can be used to assess interaction 
between HER2 and HER3 as well as the interaction between heregulin and HER2 and 
HER3. As is known in the art, the elucidation of structural domains associated with 

25 biological activities and the signalling mechanisms of molecules that are associated with 
pathologies such as cancer is a crucial step in the identification of novel diagnostic and 
therapeutic methods and materials (e.g. heregulin agonists and antagonists) which can be 
used to treat such pathologies. 
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In this regard, the disclosure provides evidence that a specific subset of amino 
acid residues in the HER3 polypeptide are likely to be involved in the interaction with 
heregulin (and consequently can be mutated in studies designed to characterize the Her2, 
HER3 and heregulin interactions). The data providing this evidence includes a number 
5 of independent and complementary observations including: (1) sequence comparison 
with residues in the structurally homologous IGF-1R molecule that are known to be 
associated with ligand binding (see, e.g. Figure 8); (2) crystallographic studies to map a 
heregulin binding footprint in HER3 (see, e.g. Figure 9); and (3) the further limitation of 
this subset to those residues that are most likely exposed to the solvent environment and 

10 therefore likely to interact with heregulin. 

For example, certain residues were chosen for mutation based on their proximity 
to the proteolysis site in domain I of HER3 (see, e.g. Singer et al. (2001) / Biol Cbem 
276(47), 44266-74), conservation to equivalent residues which are critical for ligand 
bmding in the IR (see, e.g. Williams et al. (1995) / Biol Chem 270(7), 3012-6) and in the 

15 hormone binding footprint in die IGF-1R (see, e.g. Garrett et aL (1998) Natun 
394(6691), 395-9) based on a multiple sequence alignment performed between HER3, IR 
and IGF-1R. As noted above, this subset of HER3 residues includes those shown in 
Tables 2 and 3. The specific results obtained with the examination of members of this 
subset of HER3 residues (e.g. VI 10 and E64) provides furdier evidence that the 

20 members of this specific subset of amino acid residues in the HER 3 polypeptide are 
involved in the interaction with heregulin. 

The HER3 variants disclosed herein include those where the wild type amino 
acid residue at one of the enumerated positions is substituted with any one of the other 
19 naturally occurring amino acids. As is known in the art, there is a reasonable 

25 expectation that a substitution at a wild type residue identified as likely to control affinity 
(for example residue 110) will effect the affinity of the HER3 variant for heregulin. 
Specifically, the pertinent art teaches that in situations where an experimental analysis has 
established that the properties of a specific residue at a particular position within the 
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polypeptide chain are likely involved in maintaining some aspect of a protein's functional 
integrity, an alteration in the size, shape, charge, hydrogen-bonding capacity or chemical 
reactivity of the amino acid side chain at one of these "active" amino acid positions is 
likely to affect the function of the protein in some way For this reason, the skilled 
5 artisan would reasonably expect a perturbation in a HER3 residue likely to control the 
affinity of HER3 for heregulin to effect the affinity of HER3 for heregulin. 

The invention further provides recombinant DNA or RNA molecules containing a 
HER3 polynucleotide, fragment, variant thereof, including but not limited to phages, 
plasmids, phagemids, cosmids, YACs, BACs, as well as various viral and non-viral vectors 

10 well known in the art, and cells transformed or transfected with such recombinant DNA or 
RNA molecules. Methods for generating such molecules are well known (see, for example, 
Sambrook et al, 1989, supra). The invention further provides a host-vector system 
comprising a recombinant DNA molecule containing a HER3 polynucleotide, fragment, 
variant thereof within a suitable prokaryotic or eukaryotic host cell. Examples of suitable 

15 eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a 
mammalian cell or an insect cell (e.g., a baculovirus-infectible cell such as an Sf9 or 
HighFive cell). Examples of suitable mammalian cells include a number of mammalian 
cells routinely used for the expression of recombinant proteins (e.g., COS, CHO, 293, 
293T cells). More particularly, a polynucleotide comprising the coding sequence of 

20 HER3 or a fragment, variant thereof can be used to generate HER3 proteins or fragments 
thereof using any number of host-vector systems routinely used and widely known in the 
art. 

A wide range of host-vector systems suitable for the expression of HER3 proteins 
or fragments thereof are available, see for example, Sambrook et al., 1989, supra; Current 
25 Protocols in Molecular Biology, 1995, supra). Preferred vectors for mammalian 
expression include but are not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and the 
retroviral vector pSRatkneo (Muller et al., 1991, MCB 11:1785). The host-vector 
systems of the invention are useful for the production of a HER3 protein or fragment 
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thereof. Such host-vector systems can be employed to study the functional properties of 
HER3 and HER3 mutations or analogs. 

Recombinant human HER3 protein can be produced by mammalian cells 
trans fected with a construct encoding a HER3 nucleotide. For example, 293T cells can 
5 be transfected with an expression plasmid encoding HER3 or fragment, variant thereof, 
the HER3 or related protein is expressed in the 293T cells, and the recombinant HER3 
protein is isolated using standard purification methods (e.g., affinity purification using 
anti : HER3* antibodies). In anodier embodiment, a HER3 coding sequence is subcloned 
into the retroviral vector pSRaMSVtkneo and used to infect various mammalian cell 
10 lines, such as NIH 3T3, COS, CHO and MCF-7 in order to establish HER3 expressing 
cell lines. Various other expression systems well known in the art can also be employed. 
Expression constructs encoding a leader peptide joined in frame to the HER3 coding 
sequence can be used for the generation of a secreted form of recombinant HER3 
protein. 

15 As discussed herein, redundancy in the genetic code permits variation in HER3 

gene sequences. In particular, it is known in the art that specific host species often have 
specific codon preferences, and thus one can adapt the disclosed sequence as preferred 
for a desired host. For example, preferred analog codon sequences typically have rare 
codons (i.e., codons having a usage frequency of less than about 20% in known 

20 sequences of the desired host) replaced with higher frequency codons. 

Additional sequence modifications are known to enhance protein expression in a 
cellular host These include elimination of sequences encoding spurious polyadenylation 
signals, exon/intron splice site signals, transposon-like repeats, and/or other such well- 
characterized sequences that are deleterious to gene expression. The GC content of the 

25 sequence is adjusted to levels average for a given cellular host, as calculated by reference 
to known genes expressed in the host cell. Where possible, the sequence is modified to 
avoid predicted hairpin secondary mRNA structures. Other useful modifications include 
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the addition of a ttanslational initiation consensus sequence at the start of the open 
reading frame, as described in Kozak, Mol. CellBioL, 9:5073-5080 (1989). 

Embodiments of the invention disclosed herein include a wide variety of art- 
accepted variants of HER3 proteins such as polypeptides having amino acid insertions, 
5 deletions and substitutions. HER3 variants can be made using methods known in the art 
such as site-directed mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed 
mutagenesis (Carter et aL, NucL Acids Res., /i;4331 (1986); Zoller et aL, NucL Acids Res., 
/0:6487 (1987)), cassette mutagenesis (Wells et aL, Gene, 34:315 (1985)), restriction 
selection mutagenesis (Wells et aL, Pbi/os. Trans. K Soc. London SerA, 317:415 (1986)) or 
10 other known techniques can be performed on the cloned DNA to produce the HER3 
variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more 
amino acids along a contiguous sequence that is involved in a specific biological activity 
such as a protein-protein interaction. Among the preferred scanning amino acids are 

15 relatively small, neutral amino acids. Such amino acids include alanine, glycine, serine, 
and cysteine. Alanine is typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the 
main-chain conformation of the variant. Alanine is also typically preferred because it is v 
the most common amino acid. Further, it is frequently found in both buried and 

20 exposed positions (Creighton, The hvteitis, (W.H. Freeman & Co., N.Y.); Chothia, J. MoL 
BioL, 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, 
an isosteric amino acid can be used 

HER3 proteins can be generated using standard peptide synthesis technology or 
using chemical cleavage methods well known in the art. Alternatively, recombinant 

25 methods can be used to generate nucleic acid molecules that encode a HER3 protein. In 
one embodiment, nucleic acid molecules provide a means to generate defined fragments of 
the HER3 protein (or variants thereof). 
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the addition of a translational initiation consensus sequence at the start of the open 
reading frame, as described in Kozak, MoL CellNoL, 9:5073-5080 (1989). 

Embodiments of the invention disclosed herein include a wide variety of art- 
accepted variants of HER3 proteins such as polypeptides having amino acid insertions, 
5 deletions and substitutions. HER3 variants can be made using methods known in the art 
such as site-directed mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed 
mutagenesis (Carter et al., NucL Acids Res., 13:4331 (1986); Zoller et aL, Nud Acids Res., 
/0:6487 (1987)), cassette mutagenesis (Wells et al, Gene, 34:315 (1985)), restriction 
selection mutagenesis (Wells et al., Philos. Trans. R. Soc. London SerA, 317:415 (1986)) or 
10 other known techniques can be performed on the cloned DNA to produce the HER3 
variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more 
amino acids along a contiguous sequence that is involved in a specific biological activity 
such as a protein-protein interaction. Among the preferred scanning amino acids are 

15 relatively small, neutral amino acids. Such amino acids include alanine, glycine, serine, 
and cysteine. Alanine is typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the 
main-chain conformation of the variant. Alanine is also typically preferred because it is 
the most common amino acid. Further, it is frequently found in both buried and 

20 exposed positions (Creighton, The Pwteins, (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. 
Biol, 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, 
an isosteric amino acid can be used. 

HER3 proteins can be generated using standard peptide synthesis technology or 
using chemical cleavage methods well known in the art. Alternatively, recombinant 

25 methods can be used to generate nucleic acid molecules that encode a HER3 protein. In 
one embodiment, nucleic acid molecules provide a means to generate defined fragments of 
the HER3 protein (or variants thereof). 
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Modifications of HER3 proteins such as covalent modifications are included 
within the scope of this invention. One type of covalent modification includes reacting 
targeted amino acid residues of a HER3 polypeptide with an organic derivatizing agent 
that is capable of reacting with selected side chains or the N- or C- terminal residues of 
5 the HER3. Another type of covalent modification of the HER3 polypeptide included 
within the scope of this invention comprises altering the native glycosylation pattern of a 
protein of the invention. Another type of covalent modification of HER3 comprises 
linking the HER3 polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the manner set 
10 forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 
4,179,337. 

B. ANTIBODIES 

It is contemplated that anti-HER3 receptor antibodies that specifically target the 
15 heregulin binding domain in HER3 may also be employed in the presently disclosed 
methods. Examples of such molecules include neutralizing or blocking antibodies which 
can preferably inhibit binding of heregulin to HER3 receptors. The anti-HER3 
antibodies may be monoclonal antibodies. 

Monoclonal antibodies may be prepared using hybridoma methods, such as those 
20 described by Kohler and Milstein, Nature , 256:495 (1975), In a hybridoma method, a 
mouse, . hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be 
immunized in vitiv. 

25 The immunizing agent will typically include a HER3 polypeptide or a fusion 

protein thereof, such as a HER3 ECD-IgG fusion protein. The immunizing agent may 
alternatively comprise a fragment or portion of HER3 having one or more amino acids that 
participate in the binding of HER3 to heregulin. In a preferred embodiment, the 
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throughput screening of chemical libraries, making them particularly suitable for 
identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein 
binding assays, biochemical screening assays, immunoassays, and cell-based assays, which 
5 are well characterized in the art. Assays for, for instance, antagonists are common in that 
they call for contacting the drug candidate with a ligand or receptor polypeptide 
identified herein under conditions and for a time sufficient to allow these two 
components to interact. 

In binding assays, the interaction is binding and the complex formed can be 

10 isolated or detected in the reaction mixture. In a particular embodiment, a preferred 
receptor polypeptide identified herein (e.g. HER3 polypeptide having only domains I and 
II) or the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by 
covalent or non-covalent attachments. Non-covalent attachment generally is 
accomplished by coating the solid surface with a solution of the ligand or receptor 

15 polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal 
antibody, specific for the ligand or receptor polypeptide to be immobilized can be used 
to anchor it to a solid surface. The assay is performed by adding the non-immobilized 
component, which may be labeled by a detectable label, to the immobilized component, 
e.g., the coated surface containing the anchored component When the reaction is 

20 complete, the non-reacted components are removed, e.g., by washing, and complexes 
anchored on the solid surface are detected. When the originally non-immobilized 
component carries a detectable label, the detection of label immobilized on the surface 
indicates that complexing occurred. Where the originally non-immobilized component 
does not carry a label, complexing can be detected, for example, by using a labeled 

25 antibody specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular ligand 
or receptor polypeptide identified herein, its interaction with that polypeptide can be 
assayed by methods well known for detecting protein-protein interactions. Such assays 
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immunizing agent comprises an extracellular domain sequence of HER3 fused to an IgG 
sequence. 

Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human rnammalian 
5 sources are desired. The lymphocytes are then fused with an immortalized cell line using a 
suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Coding, 
Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59-103). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 

10 employed The hybridoma cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRI), the culture medium 
for the hybridomas typically will include hypoxanthine, arninopterin, and thymidine ("HAT 

1 5 medium"), which substances prevent the growth of HGPRT-deficient cells. 

C. METHODS OF SCREENING MOLECULES THAT BIND TO THE 
HEREGULIN BINDING SITE IN HER3 

20 The invention also encompasses methods of screening molecules to identify 

those which can act as agonists or antagonists of the heregulin/HER3 interaction. Such 
molecules may comprise small molecules or polypeptides, including antibodies. 
Examples of small molecules include, but are not limited to, small peptides or peptide- 
like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or 

25 inorganic compounds. The screening assays for drug candidates are designed to identify 
compounds or molecules that bind or complex with the ligand or receptor polypeptides 
identified herein, or otherwise interfere with the interaction of these polypeptides with 
other cellular proteins. Such screening assays will include assays amenable to high- 
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include traditional approaches, such as, e.g., cross-linking, co-irnmunoprecipitation, and 
co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by 
Fields and co-workers (Fields and Song, Nature (London) . 340:245-246 (1989); Chien et 
5 aL, Proc. Natl. Acad. Sci. USA . 88:9578-9582 (1991)) as disclosed by Chevray and 
Nathans, Proc. Nad. Acad Sci. USA , 89: 5789-5793 (1991). Many transcriptional 
activators, such as yeast GAL4, consist of two physically discrete modular domains, one 
acting as the DNA-binding domain, the other one functioning as the transcription- 
activation domain. The yeast expression system described in the foregoing publications 

10 (generally referred to as the "two-hybrid system") takes advantage of this property, and 
employs two hybrid proteins, one in which the target protein is fused to the DNA- 
binding domain of GAL4, and another, in which candidate activating proteins are fused 
to the activation domain. The expression of a GALl-lacZ reporter gene under control of 
a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein- 

15 protein interaction. Colonies containing interacting polypeptides are detected with a 
chromogenic substrate for P-galactosidase. A complete kit (MATCH3\1AKER™) for 
identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be 
extended to map protein domains involved in specific protein interactions as well as to 

20 pinpoint amino acid residues that are crucial for these interactions. 

Compounds or molecules that interfere with the interaction of a ligand or 
receptor polypeptide identified herein and other intra- or extracellular components can 
be tested as follows: usually a reaction mixture is prepared containing the product of the 
gene and the intra- or extracellular component under conditions and for a time allowing 

25 for the interaction and binding of the two products. To test the ability of a candidate 
compound to inhibit binding, the reaction is run in the absence and in the presence of 
the test compound. In addition, a placebo may be added to a third reaction mixture, to 
serve as positive control. The binding (complex formation) between the test compound 
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and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reaction(s) but not in 
the reaction mixture containing the test compound indicates that the test compound 
interferes with the interaction of the test compound and its reaction partner. 
5 To assay for antagonists, the ligand or receptor polypeptide may be added to a 

cell along with the compound to be screened for a particular activity and the ability of 
the compound to inhibit the activity of interest in die presence of the ligand or receptor 
polypeptide indicates that the compound is an antagonist to the ligand or receptor 
polypeptide. Alternatively, antagonists may be detected by combining the ligand or 

10 receptor polypeptide and a potential antagonist with membrane-bound polypeptide 
receptors or recombinant receptors under appropriate conditions for a competitive 
inhibition assay. The ligand or receptor polypeptide can be labeled, such as by 
radioactivity, such that the number of polypeptide molecules bound to the receptor can 
be used to determine the effectiveness of the potential antagonist. The gene encoding 

15 the receptor can be identified by numerous methods known to those of skill in the art, 
for example, ligand panning and FACS sorting. Coligan et al., Current Protocols in 
Immun. T 1(2): Chapter 5 (1991). Preferably, expression cloning is employed wherein 
polyadenylated RNA is prepared from a cell responsive to the ligand or receptor 
polypeptide and a cDNA library created from this RNA is divided into pools and used to 

20 transfect COS cells or other cells that are not responsive to the ligand or receptor 
polypeptide. Transfected cells that are grown on glass slides are exposed to labeled 
ligand or receptor polypeptide. The ligand or receptor polypeptide can be labeled by a 
variety of means including iodination or inclusion of a recognition site for a site-specific 
protein kinase. Following fixation and incubation, the slides are subjected to 

25 autoradiographic analysis. Positive pools are identified and sub-pools are prepared and 
re- trans fee ted using an interactive sub-pooling and re-screening process, eventually 
yielding a single clone that encodes the putative receptor. 
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As an alternative approach, labeled ligand polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express receptor molecule. Cross- 
linked material is resolved by PAGE and exposed to X-ray film. The labeled complex 
containing the receptor can be excised, resolved into peptide fragments, and subjected to 
5 protein micro-sequencing. The amino acid sequence obtained from micro-sequencing 
would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

A representative embodiment of this invention comprises a method of screening 
for a molecule that interacts with the heregulin binding domain in HER3 ("HER3 

10 heregulin binding domain") comprising the steps of contacting a population of molecules 
with a polypeptide comprising the HER3 heregulin binding domain, allowing the 
population of molecules and the HER3 heregulin binding domain to interact under 
conditions that facilitate an interaction, determining the presence of a molecule that 
interacts with the HER3 heregulin binding domain, and then separating molecules that 

15 do not interact with the HER3 heregulin binding domain from molecules that do. In a 
specific embodiment, the method further comprises purifying a molecule that interacts 
with the HER3 heregulin binding domain. The identified molecule can be used to 
modulate a function performed by HER3. This embodiment of the invention is well 
suited to screen chemical libraries for molecules which modulate, e.g., inhibit, antagonize, 

20 or agonize or mimic, the activity of HER3 as measured by one of the assays disclosed 
herein or known in the art (see, e.g. Aigular et al, (1999) Oncogene 18, 6050-6062). The 
chemical libraries can be peptide libraries, peptidomimetic libraries, chemically 
synthesized libraries, recombinant, e.g., phage display libraries, and in vitro translation- 
based libraries, other non-peptide synthetic organic libraries. 

25 Exemplary libraries are commercially available from several sources (ArQule, 

Tripos /PanLabs, ChemDesign, Pharmacopoeia). In some cases, these chemical libraries 
are generated using combinatorial strategies that encode the identity of each member of 
the library on a substrate to which the member compound is attached, thus allowing 
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direct and immediate identification of a molecule that is an effective modulator. Thus, in 
many combinatorial approaches, the position on a plate of a compound specifies that 
compound's composition. Also, in one example, a single plate position may have from 
1-20 chemicals that can be screened by administration to a well containing the 
5 interactions of interest. Thus, if modulation is detected, smaller and smaller pools of 
interacting pairs can be assayed for the modulation activity. By such methods, many 
candidate molecules can be screened. 

Many diversity libraries suitable for use are known in the art and can be used to 
provide compounds to be tested according to the present invention. Alternatively, 

10 libraries can be constructed using standard methods. Chemical (synthetic) libraries, 
recombinant expression libraries, or polysome-based libraries are exemplary types of 
libraries that can be used. 

In one embodiment, one can screen peptide libraries to identify molecules that 
interact with HER3 heregulin binding domain protein sequences. In such methods, 

15 peptides that bind to the HER3 heregulin binding domain , are identified by screening 
libraries that encode a random or controlled collection of amino acids. Peptides encoded 
by the libraries are expressed as fusion proteins of bacteriophage coat proteins, the 
bacteriophage particles are then screened against the protein of interest. 

Accordingly, peptides having a wide variety of uses, such as therapeutic, 

20 prognostic or diagnostic reagents, are thus identified without any prior information on 
the structure of the expected ligand or receptor molecule. Typical peptide libraries and 
screening methods that can be used to identify molecules that interact with HER3 
heregulin binding domain sequences are disclosed for example in U.S. Patent Nos. 
5,723,286 issued 3 March 1998 and 5,733,731 issued 31 March 1998. 

25 Small molecules and ligands that interact with HER3 heregulin binding domain 

can be identified through related embodiments of such screening assays. For example, 
small molecules can be identified that interfere with protein function, including 
molecules that interfere with the tyrosine kinase activity associated with HER3. 
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A typical embodiment is a method of identifying a compound which specifically 
binds the HER3 heregulin binding domain, comprising the steps of contacting the HER3 
heregulin binding domain with a test compound under conditions favorable to binding; 
and then determining whether said test compound binds to said HER3 heregulin binding 
5 domain so that a compound which binds to said domain can be identified. As the 
interaction between various receptor tyrosine kinases and a variety of test compounds 
have been previously described, skilled artisans are familiar with the conditions 
conducive to binding. A specific embodiment of this aspect of the invention includes 
the steps of transfecting cells with a construct encoding HER3, contacting said cells with 

10 said test compound that is tagged or labelled with a detectable marker and then analyzing 
said cells for the presence bound test compound. In contexts where the transfected cells 
are observed to preferentially bind the test compound as compared to cells that have not 
been transfected with a HER3 construct, this indicates that the test compounds is 
binding to the HER3 protein expressed by those cells. 

15 A test compound which binds said HER3 heregulin binding domain may then be 

further screened for the inhibition of a biological activity (e.g. tyrosine kinase activity) 
associated with HER3. Such an embodiment includes, for example determining whether 
said test compound inhibits the tyrosine kinase activity associated with HER3 by utilizing 
molecular biological protocols to create recombinant contracts whose enzymological and 

20 biological properties can be examined directly. Enzymology is performed for example, 
by measuring tyrosine kinase activity in vitro or in HER3 expressing cells using standard 
assays. 

D. ASSAY METHODS 
25 Diagnostic methods are also provided herein. For instance, the antagonists or 

agonists may be employed to detect HER3 and/or heregulin in mammals known to be 
or suspected of having a HER3 and/ or heregulin related pathological condition. The 
antagonist or agonist molecule may be used, e.g., in immunoassays to detect or quantitate 
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hetegulin and/or HER3 in a sample. A sample, such as cells obtained from a mammal, 
can be incubated in the presence of a labeled antagonist or agonist molecule, and 
detection of the labeled antagonist or agonist bound in the sample can be performed. 
Such assays, including various clinical assay procedures, are known in the art, for instance 
5 as described in Voller et al., Immunoassays . University Park, 1981. 

Ligand/receptor binding studies may be carried out in any known assay method, 
such as competitive binding assays, direct and indirect sandwich assays, and 
irnmunoprecipitation assays. Cell-based assays and animal models can be used as 
diagnostic methods and to further understand the interaction between the ligands and 

10 receptors identified herein and the development and pathogenesis of the conditions and 
diseases referred to herein. 

In one approach, mammalian cells may be transfected with the ligands or 
receptors described herein, and the ability of the agonists or antagonists to stimulate or 
inhibit binding or activity is analyzed Suitable cells can be transfected with the desired 

15 gene, and monitored for activity. Such transfected cell lines can then be used to test the 
ability of antagonist^) or agonist(s) to inhibit or stimulate, for example, to modulate B- 
cell proliferation or Ig secretion. Cells transfected with the coding sequence of the genes 
identified herein can further be used to identify drug candidates for the treatment of 
immune related diseases or cancer. 

20 In addition, primary cultures derived from transgenic animals can be used in the 

cell-based assays. Techniques to derive continuous cell lines from transgenic animals are 
well known in the art (see, e.g., Small et al, Mol. Cell. BioL 5:642-648 (1985)). 

E. FORMULATIONS 

25 The HER3 antagonists or agonists described herein, are optionally employed in 

a carrier. Suitable carriers and their formulations are described in Remington's 
Pharmaceutical Sciences . 16th ed., 1980, Mack Publishing Co., edited by Osol et al. 
Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the carrier 
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to render the formulation isotonic. Examples of the carrier include saline, Ringer's solution 
and dextrose solution. The pH of the carrier is preferably from about 5 to about 8, and 
more preferably from about 7.4 to about 7.8. It will be apparent to those persons skilled in 
the art that certain carriers may be more preferable depending upon, for instance, the route 
5 of administration and concentration of active agent being administered The carrier may be 
in the form of a lyophilized formulation or aqueous solution. 

Acceptable carriers, excipients, or stabilizers are preferably nontoxic to cells 
and/ or recipients at the dosages and concentrations employed, and include buffers such 
as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and 

10 methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; 
hexamethonium chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl 
or benzyl alcohol; alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; 
cyclohexanol; 3-pentanol; and m-cresol); low molecular weight (less than about 10 
residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 

15 hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, histidine, arginine, or lysine; monosaccharides, disaccharides, and 
other carbohydrates including glucose, mannose, or dextrins; chelating agents such as 
EDTA; sugars such as sucrose, mannitol, trehalose or sorbitol; sdt-forming counter-ions 
such as sodium; and/or non-ionic surfactants such as TWEEN™, PLURONICS™ or 

20 polyethylene glycol (PEG). 

The formulation may also contain more than one active compound as necessary 
for the particular indication being treated, preferably those with complementary activities 
that do not adversely affect each other. 

The HER3 antagonist or agonist described herein, may also be entrapped in 

25 microcapsules prepared, for example, by coacervation techniques or by interfacial 
polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for 
. example, liposomes, albumin microspheres, microemulsions, nano-particles and 
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nanocapsules) or in macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980). 

The formulations to be used for in vivo administration should be sterile. This is 
readily accomplished by filtration through sterile filtration membranes. Sustained-release 
5 preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the active 
agent, which matrices are in the form of shaped articles, e.g. films, or microcapsules. 
Examples of sustained-release matrices include polyesters, hydrogels (for example, 
poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. No. 

10 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable 
ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the 
LUPRON DEPOT™ (injectable microspheres composed of lactic acid-glycolic acid 
copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While 
polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of 

15 molecules for over 100 days, certain hydrogels release proteins for shorter time periods. 

Related embodiments of the invention include methods for the preparation of a 
medication for the treatment of pathological conditions including cancer by preparing a 
HER3 variant polypeptide (e.g. a variant having a substitution at VI 10) composition for 
adixiinistration to a mammal having the pathological condition. A related method is the 

20 use of an effective amount of a HER3 variant polypeptide in the preparation of a 
medicament for the treatment of cancer. Such' methods typically involve the steps of 
including an amount of HER3 variant polypeptide sufficient to inhibit the interaction of 
heregulin and HER3 in vivo and an appropriate amount of a physiologically acceptable 
carrier. As is known in the art, optionally other agents can be included in these 

25 preparations. 
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F. MODES OF THERAPY 

The invention further provides methods for modulating HER3, and/or heregulin 
activity in mammalian cells which comprise exposing the cells to a desired amount of 
antagonist or agonist that affects heregulin interaction with HER3. Preferably, the amount 
5 of antagonist or agonist employed will be an amount effective to affect the binding and/ or 
activity of the respective ligand or respective receptor to achieve a therapeutic effect. This 
can be accomplished in vivo or ex vivo in accordance, for instance, with the methods 
described below. Exemplary conditions 01 disorders to be treated with HER3 agonists or 
heregulin agonists include cancer such as those associated with an aberrant expression of 

10 HER3 and/ or heregulin. 

The molecules described herein are useful in treating various pathological 
conditions, such as cancer. These conditions can be treated by stimulating or inhibiting a 
selected activity associated with heregulin or HER3 in a mammal through, for example, 
administration of one or more antagonists or agonists described herein. 

15 Diagnosis in mammals of the various pathological conditions described herein 

can be made by the skilled practitioner. Diagnostic techniques are available in the art 
which allow, e.g., for the diagnosis or detection of cancer or immune related disease in a 
mammaL For instance, cancers may be identified through techniques, mciuding but not 
limited to, palpation, blood analysis, x-ray, NMR and the like. 

20 The antagonist^) or agonist(s) can be administered in accord with known 

methods, such as intravenous administration as a bolus or by continuous infusion over a 
period of time, by intramuscular, intraperitoneal, intracerebrospinal, subcutaneous, intra- 
articular, intrasynovial, intrathecal, oral, topical, or inhalation routes. Optionally, 
administration may be performed through mini-pump infusion using various 

25 commercially available devices. The antagonists or agonists may also be employed using 
gene therapy techniques which have been described in the art. Effective dosages and 
schedules for administering antagonists or agonists may be determined empirically, and 
making such determinations is within the skill in the art. Single or multiple dosages may be 
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employed. It is presently believed that an effective dosage or amount of antagonist or 
agonist used alone may range from about 1 ng/kg to about 100 fng/V% of body weight or 
more per day. Interspecies scaling of dosages can be performed in a manner known in the 
art, e.g., as disclosed in Mordenti et al., Pharmaceut. Res- , 8:1351 (1991). 
5 As noted above, the HER3 polypeptides, HER3 polypeptide variants, HER3 

polypeptide fragments, HER3 polynucleotides encoding said polypeptides, variants and 
fragments^ and the HER3 agents useful in the methods of the invention can be 
incorporated into pharmaceutical compositions suitable for administration into a 
mammal. Such compositions typically comprise at least one HER3 polypeptide, HER3 

10 polypeptide variant, HER3 polypeptide fragment, HER3 polynucleotide encoding said » 
polypeptide, variant or fragment, an HER3 agent, or a combination thereof, and a 
pharmaceutically acceptable carrier. Methods for formulating the HER3 compounds of 
the invention for pharmaceutical administration are known to those of skill in the art 
See, for example, Remington: The Science and Practice of Pharmacy, 19 th Edition, 

15 Gennaro (ed.) 1995, Mack Pubhshing Company, Easton, PA. 

As used herein the language "pharmaceutically acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, antibacterial and antifungal 
agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 

20 active substances is well known in the art Except insofar as any conventional media or 
agent is incompatible with the active compound, such media can be used in the 
compositions of the invention. Supplementary active compounds can also be 
incorporated into the compositions. A pharmaceutical composition of the, invention is 
formulated to be compatible with its intended route of administration. 

25 The pharmaceutical compositions of the invention, comprising HER3 

polypeptides, HER3 polypeptide variants, HER3 polypeptide fragments, polynucleotides 
encoding said HER3 polypeptides, variants and fragments, as well as HER3 agents, as 
defined above, are administered in therapeutically effective amounts. The 
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"therapeutically effective amount" refers to a nontoxic dosage level sufficient to induce a 
desired biological result (e.g. a diminution of the severity of the symptoms associated 
with a pathological condition such as breast or ovarian cancer). Amounts for 
administration may vary based upon the desired activity, the diseased state of the 
5 mammal being treated, the dosage form, method of administration, patient factors such 
as age, sex, and severity of disease. It is recognized that a therapeutically effective 
amount is provided in a broad range of concentrations. Such range can be determined 
based on in vitro and/or in vivo assays. 

Therapeutic compositions of the HER3 can be prepared by mixing the desired 

10 HER3 molecule having the appropriate degree of purity with optional pharmaceutically 
acceptable carriers, excipients, or stabilizers (Remington's Pharmaceutical Sciences , 16th 
edition, Osol, A. ed. (1980)), in the form of lyophilized formulations, aqueous solutions 
or aqueous suspensions. Acceptable carriers, excipients, or stabilizers are preferably 
nontoxic to recipients at the dosages and concentrations employed, and include buffers 

15 such as Tris, HEPES, PIPES, phosphate, citrate, and other organic acids; antioxidants 
including ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl 
ammonium chloride; hexamethonium chloride; benzalkonium chloride, benzethonium 
chloride; phenol, butyl or benzyl alcohol; alkyl parabens such as methyl or propyl 
paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular 

20 weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino 
acids such as glycine, glutamine, asparagine, histidine, arginine, or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or 
dextrins; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming counter- 

25 ions such as sodium; and/or non-ionic surfactants such as TWEEN™, PLURONICS™ 
or polyethylene glycol (PEG). 

Additional examples of such carriers include ion exchangers, alumina, aluminum 
stearate, ledthin, serum proteins, such as human serum albumin, buffer substances such 
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as glycine, sorbic acid, potassium sorbate, partial glyceride mixtures of saturated 
vegetable fatty acids, water, salts, or electrolytes such as protarnine sulfate, disodium 
hydrogen phosphate, potassium hydrogen phosphate, sodium chloride, colloidal silica, 
magnesium trisilicate, polyvinyl pyrrolidone, and cellulose-based substances. Carriers for 
5 topical or gel-based forms include polysaccharides such as sodium 
carboxymethylcellulose or methylcellulose, polyvinylpyrrolidone, polyacrylates, 
polyoxyethylene-polyoxypropylene-block polymers, polyethylene glycol, and wood wax 
alcohols. For all administrations, conventional depot forms are suitably used. Such 
forms include, for example, microcapsules, nano-capsules, liposomes, plasters, inhalation 

10 forms, nose sprays, sublingual tablets, and sustained-release preparations. 

Solutions or suspensions used for administering HER3 can include the following 
components: a sterile diluent such as water for injection, saline solution; fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 

15 sodium bisulfite; chelating agents such as EDTA; buffers such as acetates, citrates or . 
phosphates and agents for the adjustment of tonicity such as sodium chloride or 
dextrose. In one embodiment, a pharmaceutical composition can be delivered via slow 
release formulation or matrix comprising HER3 protein or DNA constructs suitable for 
expression of HER3 protein into or around a site within the body. 

20 HER3 can also be administered in the form of a variety of sustained-release 

preparations. For example, HER3 may be delivered to the lung for slow release via 
encapsulation or carrier materials such as liposomes, or other drug "shells" such as 
albumin (Albunex by Molecular Biosystems), sugars (Levovist by Schering), gelatins, or 
lipids. Other suitable examples of sustained-release preparations for use with 

25 polypeptides including semipermeable matrices of solid hydrophobic polymers 
containing the protein, which matrices are in the form of shaped articles, e.g., films, or 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (<?.£, 
poly(2-hydroxyethyl-methacrylate) as described by Langer et al, T. Biomed. Mater. Res. T 
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15: 167-277 (1981) and Langer, Chem. Tech. . 12: 98-105 (1982) or poly(vinylalcohol)), 
polylactides (U.S. Patent No. 3,773,919, EP 58,481), copolymers of ^glutamic acid and 
gamma ethyl-L-glutamate (Sidman et al y Biopolymers . 22: 547-556 (1983)), non- 
degtadable ethylene-vinyl acetate (Langer et a/. y supra), degradable lactic acid-glycolic acid 
5 copolymers such as the Lupron Depot (injectable microspheres composed of lactic acid- 
glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid 
(EP 133,988). 

The route of administration may vary depending on the desired effect and/or 
outcome. Generally for initiation of an HER3 mediated response, introduction of the 

10 HER3 at or near the desired site of response is utilized. Alternatively additional routes of 
administration, such as a systemic aclministration of HER3, may be employed. Examples 
of routes of ad m inistration include parenteral, e.g., intravenous, intradermal, 
• intramuscular, subcutaneous, oral (e.g., inhalation) transdermal (topical), transmucosal 
(e.g. a nasal spray), and rectal administration. The HER3 polypeptide may also be 

15 administered by perfusion techniques, such as isolated tissue perfusion, to exert local 
therapeutic effects. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent such as 
water for injection, saline solution; fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 

20 parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
EDTA; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. Regimens of administration may vary. A 
single dose or multiple doses of the agent may be used. Such regimens can vary 
depending on the severity of the disease and the desired outcome. Following 

25 administration of a HER3 polypeptide to the mammal, the mammal's physiological 
condition can be monitored in various ways well known to the skilled practitioner familiar 
with the pathological condition to be treated (e.g. breast or ovarian cancer). 
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It is contemplated that yet additional therapies may be employed in the methods. 
The one or more other therapies may include but are not limited to, administration of 
radiation therapy, cytokine^), growth inhibitory agent(s), chemotherapeutic agent(s), 
cytotoxic agent(s), tyrosine kinase inhibitors, ras farnesyl transferase inhibitors, angiogenesis 
5 inhibitors, and cyclin-dependent kinase inhibitors which are known in the art and defined 
further with particularity in Section I above. In addition, therapies based on therapeutic 
antibodies that target tumor antigens such as Rituxan™ . or Herceptin™ as well as and- 
angiogenic antibodies such as anti-VEGF. 

Preparation and dosing schedules for chemotherapeutic agents may be used 

10 according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described 
in Chemotherapy Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). The 
chemotherapeutic agent may precede, or follow administration of, e.g. an antagonist, or 
may be given simultaneously therewith. The antagonist, for instance, may also be 

15 combined with an anti-oestrogen compound such as tamoxifen or an anti-progesterone 
such as onapristone (see, EP 616812) in dosages known for such molecules. 

G. ARTICLES OF MANUFACTURE 

In another embodiment of the invention, an article of manufacture containing 
20 materials useful for the treatment of the disorders described above is provided. The 
article of manufacture comprises a container and a label. Suitable containers include, for 
example, botdes, vials, syringes, and test tubes. The containers may be formed from 
a variety of materials such as glass or plastic. The container holds a composition which is 
effective for treating the condition and may have a sterile access port (for example the 
25 container may be an intravenous solution bag or a vial having a stopper pierceable by a 
hypodermic injection needle). The active agents in the composition may comprise 
antagonist^) or agonist(s). The label on, or associated with, the container indicates that 
the composition is used for treating the condition of choice. The article of manufacture 
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may further comprise a second container comprising a pharmaceutically-acceptable 
buffer, such as phosphate-buffered saline, Ringer's solution and dextrose solution. It 
may further include other materials desirable from a commercial and user standpoint, 
including other buffers, diluents, filters, needles, syringes, and package inserts with 
5 instructions for use. 

H. BRIEF CHARACTERIZATION OF ASPECTS OF THE INVENTION 

As disclosed herein, the predominant ligand binding site for heregulin is located 
10 in domain I of HER3. The disclosure identifies two proteolytic sites in HEIG-ECD 1 -™ 
at positions 50 and 270. Cleavage at position 50 is fully protected by hrg binding, 
whereas position 270 is unprotected. Hrg protection at position 50 is a result of the 
specific interaction between hrg and HER3-ECD IIV . Only two fragments were detected 
by SDS-PAGE and mass spectrometry. The N-terminal fragment generated from 
15 cleavage at residue 50 could not be detected, but its absence could be due to additional 
cleavage sites between residues 20 and 50. The cleavage site at position 270 is not 
protected by hrg, but the fragments generated by this cleavage are held together by 
disulfide bonds. Hrg protects only one of the cleavage sites in a ligand-specitic manner. 
The extreme C terminus is also sensitive to proteolysis, which could be blocked by an 
20 antibody against the V-5 epitope tag but not by hrg. 

This protection at position 50 can be explained in two different ways, which 
include binding of hrg at or near position 50 or indirect protection resulting from a 
conformational change in HEI^-ECD 1 '™, induced by hrg binding. In order to 
distinguish between these two possibilities based on the results of the proteolysis 
25 experiments; we expressed domain I and II of HER3-ECD WV (HER3-ECD 1 - 11 ) and 
assayed it for binding. 

As illustrated in the Examples below, HER3-ECD IU polypeptide is able to bind 
heregulin and the association state and binding properties of HER3-ECD 1 " 11 were 
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characterized. For example, HER3-ECD 1 " 11 is shown to be a monomer and remains in a 
monomelic state when hrg is present This is in contrast to the HER3-ECD IIV 
expressed in S2 cells in which hrg reverses oligomerization to form a monomer (see, e.g. 
Landgraf, R., and Eisenberg, D. (2000) Biochemistry 39, 8503-8511). As discussed 
5 below we analyzed direct hrg bincling to HER3-ECD 1-11 and qualitatively demonstrated 
specific binding to hrg binding by a pull-down assay. An equiHbrium dissociation 
constant of 68 nM for hrg binding was calculated by surface plasmon resonance in which 
the trx-hrg was immobilized. HER3-ECD 1-11 has 30-fold lower binding affinity than 
HER3-ECD I - IV (2.3 nM) but a 7-fold greater affinity for its ligand than the extracellular 

10 domain of EGFR (500 nM) (see, e.g. Kohda et aL, (1993) J. Biol. Chem. 268, 1976-1981). 
Consequendy the results demonstrate that domains I and II of HER3-ECD are sufficient 
for hrg binding, and the results from proteolysis protection provide evidence that a 
ligand binding site is located in domain I. 

Limited proteolysis of HER3-ECD WV and the expression of HER3-ECD 1 - 11 

15 provides evidence that the extent to which different domains of the type I receptor 
tyrosine kinases (EGFR, HER2, HER3, and HER4) contribute to ligand binding may 
not be conserved among the members of this EGFR family. The data provides evidence 
that domain I of HER3-ECD MV contains a site involved in ligand binding, whereas in 
EGFR, multiple lines of evidence provide evidence that the ligand contacts both domain 

20 I and III (see, e.g. Lax et aL, (1989) EMBO J. 8, 421-427; Lax et aL, (1991) Cell Regul. 2, 
337-345; Wu et aL, (1989) J. Biol. Chem 264, 17469-17475; Summerfield et aL, (1996) J. 
Biol. Chem. 271, 19656-19659; Wu et aL, (1990) Proc. Nad. Acad. Sci U. S. A. 87, 3151- 
3155) but that domain III alone is sufficient for ligand binding (see, e.g. Kohda et aL, 
(1993) J. Biol. Chem. 268, 1976-1981). It is noteworthy that the Kd for transforming 

25 growth factor binding to the monomelic ECD of EGFR is 500 nM. Domain III of 
EGFR binds transforming growth fector with a Kd of 1.3 uM. The Kd of HER3-ECD i rv 
is 1.9 nM, whereas the Kd for HER3-ECD 1 -" is 68 nM. Therefore, despite a loss of 
binding compared with HER3-ECD I ' IV , HER3-ECD MI still retains a higher affinity for 
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its ligand than the EGFR ECD. EGFR forms dimers in response to ligand binding (see, 
e.g. Hurwitz et aL, (1991) J. Biol. Chem. 266, 22035-22043), whereas HER3-ECDI-IV 
expressed in S2 cells exists as an oligomer but, following ligand binding, forms a 
monomer (see, e.g. Landgraf, R, and Eisenberg, D. (2000) Biochemistry 39, 8503-8511). 
5 This may be a reflection of the different modes of ligand binding behavior in both 
receptors. The data provides evidence that for HER3-ECD I IV ligand binding occurs 
pardy in domain I, whereas domain III may also contribute to binding; the relative 
contribution of domains I and II to binding appears to be shifted in favor of domain I, 
for HER3. 

10 An understanding of the activity of type I RTKs has been hindered by the lack of 

a molecular structure for any member of the family. It has been proposed that the 
structure of type I RTK ECDs may be similar to the insulin growth factor-1 receptor 
(IGF-1R) (see, e.g. Jorissen et aL, (2000) Protein Sci. 9, 310-324; Garrett et aL, (1998) 
Nature 394, 395-399). The structure of the first three domains of IGF-1R was solved by 

15 x-ray crystallography at 2.6-A resolution (see, e.g. Garrett et aL, (1998) Nature 394, 395- 
399). IGF-1R is a type II receptor tyrosine kinase and is a member of the tyrosine kinase 
superfamily, which includes the type I (EGFR/HER3) subfamily (see, e.g. Ullrich, A., 
and Schlessinger, J. (1990) Cell 61, 203-212). HER3 and IGF-1R have significant 
sequence identity in portions of the extracellular domain and have a similar domain 

20 organization (see, e.g. Yarden, Y., and Ullrich, A. (1988) Annu. Rev. Biochem. 57, 443- 
478; Lax et aL, (1988) Mol. Cell. Biol. 8, 1970-1978). In addition, this equivalence has 
been proposed based on homology modeling between HER3-ECD I_IV and the IGF-1R 
structure (see, e.g. Jorissen et aL, (2000) Protein Sci. 9, 310-324). The IGF-1R structure 
contains three domains (LI, SI, and L2), which are equivalent to domains I, II, and III in 

25 HER3-ECD wv (Table I). Domains LI and L2 are similar in sequence in IGF-1R (25% 
identity and 41% similarity), as are domains I and III in HEItf-ECD 1 -™ (30% identity 
and 41% similarity). Domains LI and L2 also have highly similar structures (see, e.g. 
Garrett et aL, (1998) Nature 394, 395-399). Alanine-scanntng mutagenesis of the insulin 
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receptor, a receptor closely related to IGF-1R, identified four regions in the primary 
sequence of LI that were important for ligand binding (see, e.g. Williams et al., (1995) J. 
Biol. Chem. 270, 3012-3016). These regions map to the IGF-1R structure, which reveals 
a "hormone binding footprint" lying on the face of domain LI (see, e.g. Garrett et aL, 
5 (1998) Nature 394, 395-399). A naturally occurring mutation in the insulin receptor at 
residue 58 lies within the footprint (see, e.g. van der Vorm et al., (1992) J. Biol Chem. 
267, 66-71). Other mutations in the insulin receptor also provide evidence minor 
contributions in domain L2 (see, e.g. Nakae et al., (1995) J. Biol. Chem. 270, 22017- 
22022). A model has been suggested in which ligand binding in the IGF-1R involves 

10 domains LI and L2 (see, e.g. Jorissen et al, (2000) Protein ScL 9, 310-324), which 
correspond to domains I and III in HER3-ECD I " IV . 

Our current analysis provides biochemical evidence for the proposed structural 
similarity of type I (HER3) and II (IGF-1R and IR) RTKs. First, a sequence alignment 
of domain LI in the IR and the IGF-1R and domain I in HER3-ECD I " IV was performed 

15 (Fig. 8). The cleavage site at position 50, which is protected by hrg corresponds to the 
putative ligand-binding region in domain LI in IGF-1R (Fig. 9A). Residue 17 in IGF-1R 
(position 50 in HEItf-ECD 1 " 17 ) lies on the same face as the binding footprint and is 11 A 
away from residue 10, which lies within the binding footprint. Second, we expressed 
HER3-ECD 111 , which binds hrg with a 68 nM dissociation constant, supporting the 

20 involvement of domain I in ligand binding. Third, the second proteolysis site in which 
the fragments are held together by a disulfide bridge at position 270 is located in the 
cysteine-rich domain II of HER3-ECD I ' IV (position 237 in the IGF-1R). Based on the 
IGF-1R structure, cleavage at this position is expected to result in a disulfide-linked 
fragment (Fig. 9B). Fourth, the large accessible surface area (1782 A 2 ) of domain LI that 

25 is buried by domain SI of IGF-1R and the conserved contacts in this interface in the 
type 1 RTKs (see, e.g. Jorissen et al., (2000) Protein Sci. 9, 310-324) provide evidence 
that these domains function as unit This is consistent with our finding that the 
expression of domain I of HER3 requires the presence of domain II. Therefore, our 
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results are consistent with the proposal that HER3 has a structure similar to IGF-1R. 
These data provide further insight into areas in the HER3 receptor critical for heregulin 
binding. 

The following examples are offered by way of illustration and not by way of 
5 limitation. Various citations are referenced throughout the specification (e.g. PCT 
application No. WO0160397A1). The disclosures of all citations in the specification are 
expressly incorporated herein by reference. 

EXAMPLES 

10 Example 1 : Common Materials 

Soluble extracellular domains of recombinant HER3 (HEItf-ECD 1 - 1 ^ were 
purified from Chinese hamster ovary cells by the procedure of Kita et aL (see, e.g. Kita et 
aL, (1996) Biochem. Biophys. Res. Commun. 226, 59-69). The 60-residue EGF-like 
domain of human heregulin (31 (residues 177-237) (hrg), the binding-deficient heregulin 

15 mutant (NAs-hrg), and the thioredoxin-heregulin fusion with a C-terminal His and S-tag 
(trx-hrg) were generated and purified as described elsewhere (see, e.g. Landgraf et aL, 
(1998) Biochemistry 37, 3220-3228). EGF was purchased from Sigma. Trypsin was 
purchased from Life Technologies, Inc. (catalog number 15400-013 and lot numbers 
12P6334 and 1023241) and Sigma (catalog number T-1426 and lot number 70K7661), 

20 which had been treated with tosylamide-2-phenylethylchloromethyl ketone to remove 
chymotrypsin. 
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Example 2: Proteolytic Examination of Heregulin and HER3 Interaction 

A. Proteolytic Digestion. 

5.0 ul of (HER3-ECD IIV ) at 2.3 mg/ml in storage buffer (20 mM sodium acetate 
pH 5.5 and 40 mM sodium chloride), 10 ul of phosphate-buffered saline (PBS): 136 mM 
5 NaCl, 2.68 mM KC1, 9.289 mM NazHPCU, 1.969 mM KH2PO4), and 3.0 ul of trypsin 
(Life Technologies, Inc.; catalog number 15400-013 and lot number 12P6334) at 5.0 
mg/ml in 5.0 mM EDTA and 150 mM sodium chloride were incubated at 37 °C for 1 h. 
Digestion was stopped by adding phenylmethylsulfonyl fluoride at a final concentration 
of 1.0 mM. 5.0 ul of the digested sample were added to 5.0 ul of 2x SDS-PAGE sample 

10 buffer with and without 1.0 mM p-mercaptoethanol (J3-ME). These samples were then 
analyzed by SDS-PAGE on a 10-15% Phast gel (Amersham Pharmacia Biotech) and 
visualized by Coomassie staining. Different sources of trypsin were compared, but the 
best results were obtained with tissue culture grade trypsin from Life Technologies, Inc., 
which may contain other proteases. 

15 HER3-ECD IIV was titrated with hrg to determine the rninimum amount required 

for complete protection of HER3-ECD IIV from proteolysis. Hrg in PBS was incubated 
with HER3-ECD IIV for 30 min at room temperature, followed by trypsin as described 
above. The final concentration of hrg in the digest ranged from 1.7 to 17 uM widi a 
constant HER3-ECD IIV concentration of 7.7 uM. These reaction mixtures were 

20 analyzed by reducing SDS-PAGE as described above. The proteolytic digest was 
repeated with NAs-hrg and EGF using a molar ratio of 1:1 . 

B. Proteolysis of HER3-ECD IwIV Results in Two Primary Fragments That Remain 
Associated under Oxidizing Conditions. 

First, we digested HER3-ECD I - IV to learn if limited proteolysis pleads to a 
25 fragment still capable of binding hrg. This results in two principal fragments (Fig. 1A, 
lane 3) with apparent molecular masses of 24 kDa (fragment 1) and 54 kDa (fragment 2), 
compared with an apparent molecular mass for the nonproteolyzed HER3-ECD I ~ rv of 
80 kDa. Gel electrophoresis of the proteolyzed HERS-ECD 1 '™ in the absence of a 
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reducing agent shows that the two fragments remain linked by disulfide bonds (Fig. 1A, 
lane 4). Fragment 2 was identified as the C- terminal fragment by Western blot using the 
V5 C-terrninal epitope-tagged HEPG-ECD 1 -^ (Fig. IB). 

5 C. Proteolytic Cleavage of HEItf-ECD 1 -™ Abolishes Heregulin Binding 

To determine die effects of cleavage of HER3-ECD I ' IV on its ability to bind hrg, 
a gel mobility shift assay was performed. In a native gel analysis, proteolyzed HER3- 
ECD 1 ' 17 (Fig. 2A, lane 2) shows an electrophoretic mobility similar to the 
nonproteolyzed HEItf-ECD 1 -™ (Fig. 2A, lane 1). The nonproteolyzed HER3-ECD IIV 

10 shows a discrete shift in the presence of a 1:1 molar ratio of hrg to HER3-ECD I - IV (Fig. 
2A, lane 3). The proteolyzed HER3-ECD WV does not shift its gel position in the 
presence of hrg (Fig. 2A, lane 4). This finding provide evidences that the two disulfide- 
linked fragments, generated by proteolytic digestion (Fig. 1A, lane 4) do not bind hrg. 

To check whether the proteolyzed HER3-ECD I IV binds hrg, we used a pull- 

15 down assay under physiological salt conditions. In this analysis, we measured the HER3- 
ECD IIV and proteolyzed HEItf-ECD 1 "™ that could be dissociated from immobilized 
diioredoxin-heregulin (trx-hrg) by the addition .of hrg. Using trx-hrg-coupled S-protein 
agarose, HER3-ECD I " IV was eluted with hrg (Fig. 2B, lane 3). No proteolyzed HER3- 
ECD I_IV could be recovered by either elution with hrg (Fig. 2B, lane 4) or detergent (Fig. 

20 2B, lane 7). However, HEID-ECD 1 -™ that was protected by hrg during proteolytic 
digestion could be recovered by detergent and competition with hrg (Fig. 2B, lanes 5 and 
8), In addition, a trypsin preparation that was treated with phenylmethylsulfonyl fluoride 
prior to the addition to the digestion reaction did not abolish hrg binding by HER3- 
ECD 1 "™ (Fig. 2B, lane 9). Based on these results, we conclude that the proteolyzed 

25 HEItf-ECD 1 -™ does not bind hrg. 
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D. Heregulin Protects HER3-ECD MV from Proteolytic Cleavage 

The observation that proteolytic cleavage destroys the binding of HER3-ECD MV 
to hrg provide evidences that cleavage occurs at or near die binding site. This led us to 
investigate the proteolysis of HER3-ECD 1IV in the presence of hrg. The digestion of 
5 HER3-ECD IIV in the presence of a molar excess of hrg (Fig. 3A, lane 3) produces a 
different cleavage pattern than in the absence of hrg (Fig. 3 A, lane 2). Complete 
protection of HER3-ECD IIV was observed (Fig. 3B, lane 3) at an approximate 1:1 molar 
ratio of hrg over HER3-ECD I " IV . Fragment 1 has a higher molecular weight in the 
presence of hrg (Fig. 3A, lane 3) than in its absence (Fig. 3A, lane 2), whereas the size of 
10 fragment 2 is apparendy not affected by the presence of hrg (Fig. 3A, lanes 2-5). The 
cleavage that produces fragment 1 is apparently blocked by hrg but produces an 
additional, smaller fragment, which was not detected in the SDS gel of the digest of 
HER3-ECD I - IV . These data provide evidence that hrg protects HER3-ECD llrv from the 
proteolytic cleavage that generates fragment 1. 

15 

E. Localization of the Cleavage Sites 

Taken together, these results provide evidence that there are at least two cleavage 
sites, one that is protected by hrg and another that is not. To localize the positions of the 
cleavage sites, we used N-terminal sequencing and mass spectrometry. Fragment 1 begins 

20 at position 51 in the absence of hrg and at position 20 (the first residue in native HER3- 
ECD I_IV ) in the presence of hrg (Fig. 4). The N-terminal 31-residue fragment generated 
from the cleavage following residue 50 was not detected by SDS-PAGE or by mass 
spectrometry. The corresponding size by mass spectrometry of fragment 1 is 26.7 kDa in 
the absence of hrg and 30 kDa in the presence (Fig. 4). The difference in mass of 

25 fragment 1 in the presence and absence of hrg is 3270 daltons, which corresponds to the 
predicted size of 3216 daltons, based on the amino acid sequence of the missing 
fragment (Fig. 4). This provide evidences that the amino acid residues between positions 
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20 and 50 of HER3-ECD IIV are not glycosylated. Based on these results, we conclude 
that hrg protects HEI^-ECD 1 -^ from proteolysis at position 50. 

The cleavage tiiat generates fragment 2 was not blocked by hrg in our assays and 
is located at position 270. The mass of fragment 2 is 47.0 kDa both in the presence and 
5 absence of hrg. The C terminus may contain additional cleavage sites that are not 
protected by hrg, because there is a missing C-terminal fragment (7.5 kDa) both in the 
presence and absence of hrg (Fig. 4). However, cleavage of the C terminus could be 
blocked by an antibody against the V5 epitope tag (Fig. lb). 

The cleavage sites on the carboxyl side of Tyr 50 and Phe 270 are more typical of 
10 chymotryptic rather than cryptic cleavage. This cleavage could be due to chymotrypsin 
present in the solution purchased from Life Technologies, Inc. However, the identity of 
the specific protease involved does not affect the finding that hrg protects HER3-ECD 1 ' 
™ from cleavage at Tyr 50 but not at Phe 270 . 

1 5 F. Protection Is Due to a Specific Interaction of hrg with HEI^-ECD 1 "™ 

To show that hrg protection is a result of the specific interaction between hrg 
and HER3-ECD I_IV , we repeated the digestion in the presence of a binding-deficient 
mutant of hrg (NAs-hrg) and EGF. NAs-hrg is deficient in binding due to the fact that 
the N-terminal residues that confer specific HER3-ECD MV binding were mutated to 

20 alanine. NAs-hrg and EGF do not shift the position of HER3-ECD I ' IV in a gel mobility 
shift assay (Fig. 5, lanes 3 and 4). The cleavage pattern obtained in the presence of NAs- 
hrg and EGF matches that of unprotected HEItf-ECD 1 -™ (Fig. 3A, lanes 4 and 5 versus 
lane 2), further confirming that they do not bind hrg. Therefore, the difference in the 
cleavage pattern in the presence of hrg is due to a specific interaction between hrg and 

25 HER3-ECD I_IV . 
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Example 3: Heregulin Binding Assays 

A. Gel Mobility Shift Assays 

HERS-ECD 1 "™ (1 A ul) in storage buffer was incubated with 9.6 fd of hrg (6.6 uM 
in PBS) for 30 min at room temperature. 4.0~uJ aliquots of this mixture were run on a 10- 
5 15% Phast gel under native conditions. This assay was repeated for NAs-hrg, EGF, and 
proteolyzed HER3-ECD I_rv under the same conditions. 

B. "Pull-down" Assay Using Immobilized Trx-hrg 

The ability of proteolyzed HEIG-ECD 1 -™ to bind hrg was analyzed by a pull- 

10 down assay. S-protein agarose (0.5 ml) (Novagen) was spun down and resuspended three 
times in PBS. Aliquots of 100 ul wete spun again, and trx-hrg (600 nM) and bovine 
serum albumin (BSA) (1 mg/ml) were added to the resin. After incubating 15 min at 4 
°C, the resin was washed five times with PBS, and blocked again with BSA. The 
following samples were added to each aliquot of resin: HER3-ECD I_IV , proteolyzed 

15 HER3-ECDi-* v , HER3-ECD I IV protected by hrg during proteolysis, and HER3-ECD I " IV 
incubated with trypsin deactivated with 1 mM phenylmethylsulfonyl fluoride prior to the 
proteolysis reaction. HERS-ECD 1 '™ was at 600 nM. BSA (1 mg/ml) was also added to 
each aliquot. Following a 15-min incubation at 4 °C, the resin was spun down and 
resuspeaded three times in PBS. Each aliquot of resin was then resuspended in PBS 

20 containing 1 uM hrg or PBS. After a 1-min incubation at 4 °C, the samples were spun 
down, and the supernatant was diluted with 2* SDS-PAGE sample buffer containing 1.0 
mM p-ME. Each aliquot of resin was then resuspended in SDS-PAGE sample buffer. 
The samples were analyzed by SDS-PAGE on a 4-15% polyacrylamide gradient gel (Bio- 
Rad) and visualized by Western blotting and chemnuminescence using a polyclonal 

25 antibody directed against the HER3-ECD (E38530; Transduction Laboratories, 
Lexington, ICY) followed by incubation with a secondary antibody conjugated to 
horseradish peroxidase (Invitrogen). 
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The ability of HER3-ECD 1 " 11 to bind hrg was also evaluated in the pull-down 
assay in the same way, with the following exceptions. HER3-ECD WI (600 nM) was 
added to the trx-hrx and control resins, and the individual samples were resuspended in 
the various competing ligands indicated in Fig. 5. HER3-ECD 1 " 11 was visualized by 
5 Western blotting and chernilurninescence using a monoclonal antibody directed against 
the V-5 epitope conjugated to horseradish peroxidase (Invitrogen). 

Example 4: Identification of the Digested Fragments and Amino Acid Analysis 

Proteolytically digested HER3-ECD WV was diluted with sample buffer 

10 containing JS-ME and run on a 4-15% polyacrylamide gradient gel (Bio-Rad) as described 
above. The samples were then transferred to a polyvinylidene difluoride membrane, and 
the fragments were visualized by Ponceau stain. Edman sequencing of excised bands was 
performed in the UCLA Protein Microsequencing Facility using a Porton-Beckman Gas 
Phase Sequencer, and the first 10 residues of each fragment were identified. HER3- 

15 ECD I_IV was proteolyzed in the presence of an equimolar amount of an antibody against 
the C-terminal V5 epitope for 30, 60, and 120 min. The C-terminal fragment was 
identified by Western blotting using an antibody against the C-terminal V5 epitope 
(Invitrogen). 

For the amino acid analysis the molar concentrations of HER3-ECD IIV , hrg, and 
20 NAs-hrg were determined by amino acid analysis at the UCLA Protein Microsequencing 
Facility. The samples were hydrolyzed in 6 N HCi at 110 °C for 18 h under a vacuum in 
a nitrogen atmosphere. The hydrolysate was visualized with phenyhsothiocyanate. The 
phenylisothiocyanate amino acids were analyzed on a reverse phase column (Novapak) 
using a sodium acetate-acetonitrile gradient (see, e.g. Bidlingmeyer et al, (1984) J. 
25 Chromatogr. 336, 93-104; Cohen, S. A., and Strydom, D. J. (1988) Anal. Biochem. 174, 
1-16). 
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Example 5: Matrix-assisted Laser Desotptioa Ionization (MALDI) Mass Spectrometry 

We used MALDI time-of-flight mass spectrometry to measure the molecular 
mass of proteolyzed HER3-ECD WV and HER3-ECD 1 -". The mass spectrometry was 
performed on a Voyager RP machine (PerCepuVe Biosystems, Framingham, 
5 Massachusetts) with BSA as an internal standard 0.3 ul of HER3-ECD WV (2.3 mg/ml) 
in 20 mM sodium acetate pH 5.5, 40 mM sodium chloride was mixed with 0.5 ul of 10 
mg/ml sinapinic acid in 70% acetonitrile, 0.1% trifluoracetate and dried on the MALDI 
plate. The digests (plus and minus hrg) of HER3-ECD I " rv were prepared in the same 
way, except that the sample was taken directly from the digest reaction after 1 h of 
10 incubation time. The digests were run under oxidizing and reducing conditions. The 
digests were reduced by the addition of 1 mM dithiothreitol prior to mixing the sample 
with the matrix. The molecular mass of HER3-ECD in (12 uM) in PBS was measured 
under the same conditions. 

15 Example 6: Sequence Alignments and Surface A rea Calculations 

An initial multiple sequence alignment was performed with the GCG program 
Pileup (Wisconsin Package, SeqLab, SeqWeb) using the Blossum 62 matrix (see, e.g. 
Henikoff, S., and Henikoff, J. G. (1992) Proc. Natl Acad. ScL U. S. A. 89, 10915-10919). 
Empirical observations of the N terminus were used in conjunction with the initial 

20 alignment to produce a final alignment We used the GCG program Bestfit (Wisconsin 
Package, SeqLab, SeqWeb) (see, e.g. Smith, T., and Waterman, M. (1981) Adv. AppL 
Math. 2, 482-489) to find the segment of greatest similarity between two sequences. The 
amount of solvent-exposed surface area of domain LI that is buried by domain SI of 
IGF-1R was calculated using the GCG program Areaimol (see, e.g. Lee, B., -and 

25 Richards, F. M. (1971) J. MoL Biol. 55, 379-400; Collaborative Computational Project. 
(1994) Acta Crystallogt. Sect. D 50, 760-763). 
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Example 7: Cloning and Expression of HER3-ECD 1 ' 11 

The HER3 cDNA was amplified from the pJTH3 plasmid provided by Amgen, 
using the N-terminal primer (CTA GTC TCT AGA TCC GAG GTG GGC AAC TCI) 
(SEQ ID NO: 7) and C-terminal primer (TAG CGA TCT AGA TTT CGG ACA GAG 
5 ACC CCC) (SEQ ID NO: 8). Following amplification, HER3-ECD 1 " 11 (residues 20-329) 
was cloned into the Bg/Ll and Xbal site of the pMT/BiP/V5-His~A expression vector 
(Invitrogen, Carlsbad, CA). This vector contains an N-terminal Drosopbila leader 
sequence and C-terminal His tag and V5 epitope tags. S2 cells were co-transfected with 
the vector containing HER3-ECD 111 and the pCoHYGRO vector (Invitrogen), which 

10 provides hygromycin resistance. We obtained a stable cell line after 3 weeks of selection 
with 300 ug/ml hygromycin (Invitrogen). Five liters of S2 medium (Sigma) with 1% fetal 
bovine serum were inoculated with S2 cells and grown to a cell density of 5 X 106 in a 
spinner flask (Bellco) and then induced for 3 days at room temperature with 500 fiM 
Q1SO4. HER3-ECD 1 ' 11 was secreted to the medium. We used an ammonium sulfate cut 

15 (80%, w/v) to precipitate the protein from the medium. The pellet was resolubilized in 
20 mM Tris, pH 7.9, L5 M NaCl, and 0.1% Tween 20. The solution was dialyzed into 20 
mM Tris, pH 7.9, 0.5 M NaCl, and 5 mM imidazole and then purified on a 5-ml 
Amersham Pharmacia Biotech 11 HITRAP Chelating" column (Amersham Pharmacia 
Biotech), loaded with NiS04. Following elution from the column, the protein was 

20 dialyzed into 20 mM sodium potassium phosphate, pH 10. The protein was loaded onto 
an anion exchange column (Bio-Rad), and pure HER3-ECD 1 ' 11 was collected from the 
flow-through. 

Example 8: Mulriangle light Scattering of HER3-ECD 111 
25 Size exclusion chromatography was performed using a TosoHaas G3000SWXL 

column, followed by light scattering on a miniDAWN three-angle light-scattering 
instrument (Wyatt Technologies, Santa Barbara, CA). Data analysis was carried out using 
ASTRA software. HER3-ECD WI 112 uM) in 20 mM Hepes, pH 7.5, was injected onto 
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the column at a flow rate of 0.4 ml/min. The refractive index increment (d;//aV) for the 
protein portion of the HER3-ECD MI in this buffer was assumed to be equal to that of 
monomeric BSA (Sigma) {dn/dc =0.181 cm 3 /g). The carbohydrate dn/dc contribution 
was estimated to be 0.157 cm 3 /g and was integrated into the final dn/dc (0.180 cm 3 /g) 
5 proportionally. Calculated extinction coefficients uncorrected for folding effect were 
used. The error to noise in the light scatter data is about ±3%. An error in the assumed 
extinction coefficient would result in a proportional error in the calculated molecular 
mass. 

10 Example 9: Ultracentrifugation of HER3-ECD 1 ' 11 

Sedimentation equilibrium was performed at 4 °C in a Beckman Optima XL- A 
analytical ultracentrifuge using absorption optics at 280 nm. A 12-mm path length six- 
sector cell was used to measure protein samples at initial A28O values of 0.15, 0.35, and 
0.75. All samples were in PBS. Sedimentation equilibrium profiles were measured at 

15 12,000 and 15,000 rpm. The data were initially fitted with a nonlinear least-squares 
exponential fit for a single ideal species using Origin (version 3.01), Since no 
concentration or speed dependence of the molecular weight was apparent, the Beckman 
global analysis software (the "multifit" option of the above mentioned software) was 
used to analyze all six scans simultaneously. A partial specific volume of 0.712 calculated 

20 from the amino acid composition and corrected to 4 °C was used (see, e.g. Cohn, E. J., 
and Edsall, J. T. (1943) in Proteins, Amino Acids and Peptides as Ions and Dipolar Ions 
(Colin, E. J. , and Edsall, J. T., eds), pp. 370-381, Reinhold PubHshing Corp., New York; 
Laue et aL, (1992) in Analytical Ultracentrifugation in Biochemistry and Polymer Science 
(Harding, S. E. , Rowe, A. J. , and Horton, J. C, eds), pp. 90-125, The Royal Society of 

25 Chernis try, Cambridge, United Kingdom). 
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Example 10: Surface Plasmon Resonance Measurement of Binding between HER3- 
ECD MI and the EGF-like Domain of Heregulin 

Trx-hrg (13 uM) in MES buffer (100 mM pH 6.0) was immobilized on a BIAcote 

CM5 chip using standard N-hyd^oxysuccinimide/l-ethyl-3-(3- 

5 dimemyiaminopropyl)carboaUirnide amine-coupling chemistry. The surface of die chip 

was blocked with ethanolamine and could be regenerated with 5.0 M NaCl and washes 

with running buffer (PBS + surfactant). HER3-ECD MI (filtered through a 0.1-um filter 

unit (MilJipore, Bedford, MA)) in running buffer was applied to the chip at various 

concentrations (in triplicate) to dete rmin e the dissociation constant. Data were analyzed 

10 with the BIAevaluation software. Competition experiments were carried out using hrg at 

an equimolar concentration to HER3-ECD 1 " 11 . 

Example 11: Recombinant HER3-ECD 1 ' 11 Is Sufficient for Heregulin Binding 

To show that a ligand binding site is located within domain I of HER3-ECD I_IV , 

15 we recombinant^ expressed domain I of HER3 alone (HER3-ECD 1 ) as well as domains 
I and II together (HER3-ECD 111 ). Expression of domain I alone resulted in an 
improperly folded protein because of incorrect disulfide formation, so only HER3- 
ECD 1 " 11 could be evaluated for ligand binding. The recombinant form of HER3-ECD 1 " 11 
(residues 20-329) was expressed in Drosophila S2 cells and purified from the medium. The 

20 protein was found to be pure by SDS-PAGE analysis with an apparent molecular mass 
of 40 kDa (Fig. 6A). We used MALDI mass spectrometry to estimate the carbohydrate 
content. The molecular mass of HER3-ECD 111 was found to be between 38.5 and 40 
kDa, compared with a theoretical molecular mass of 37.7 kDa (Fig, 6B). This provide 
evidences that HER3-ECD 1 " 11 has a carbohydrate content between 3.3 and 5.7% by 

25 weight. 

Size exclusion chromatography followed by multiangle light scattering was used 
to show that recombinant HER3-ECD 1-11 is a monomer. The concentration of HER3- 
ECD 1 ' 11 at the time of light scattering was 1.8 fxM. At this concentration, a single peak 
was observed with a molecular mass of ~34 kDa (S.E. = ±4%) corresponding to 
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monomelic HER3-ECD * _n (Fig. 6C). There was a slight shift in the peak upon the 
addition of a 1.3 -fold molar excess of hrg. The shifted peak has an estimated molecular 
mass of 35 kDa (S.E. = ±3%) (Fig. 6C). To confirm that HER3-ECD 1 ' 11 is a monomer, 
we used ultracentrifugation at three different concentrations and three different speeds. 
5 The estimated average molecular weight of HER3-ECD 1 " 11 , using a group analysis of all 
of the scans, is 35,000 ± 210 daltons (Fig. 6D). This is consistent with the expected 
molecular mass (37.5 kDa) of monomeric HER3-ECD 1 " 11 . 

We used two independent methods to demonstrate direct binding of hrg to 
HER3-ECD 111 : a pull-down assay and surface plasmon resonance (SPR; BIAcore) 

10 analysis. Bmding of hrg to HER3-ECD 111 was shown using a pull-down assay under 
physiological salt conditions. In this analysis, we measured the amount of the V5 -tagged 
HER3-ECD 1 " 11 that dissociated from immobilized trx-hrg by the addition of hrg. Using 
the trx-hrg-coupled S-protein-agarose, HER3-ECD 1 ' 11 could be eluted widi 1 uM hrg 
(Fig. 7A, lane 4). In contrast, neither PBS (Fig. 7A, lane 3) nor EGF (Fig. 7A, lane 5) was 

15 effective in eluting HER3-ECD" 1 . 

To further confirm that HER3-ECD 1 - 11 binds hrg, we analyzed the interaction by 
SPR (BIAcore). Trx-hrg fusion protein was immobilized on a BIAcore chip for these 
measurements. SPR measurements using HER3-ECD 111 showed binding to the 
immobilized trx-hrg with a calculated equilibrium dissociation constant of 68 nM, 

20 calculated direcdy from kon and koff (kon " (1.07 ± 0.1) X 10 s M" 1 sr l ; koff = (7.27 ± 1) X 
10' 3 s" 1 ) (Fig. 7b). This interaction could be inhibited by stoichiometric concentrations of 
hrg. Based on these results, we conclude that HER3-ECD 111 containing domains I and II 
of HER3 binds hrg. 



70 



WO 03/011897 



PCT/US02/23963 



Example 12: Identification. Generation and Characte rization of HER3 Residues 
Involved in Heregulin Binding 

A. Generation of trx-hrg and HER3 Point Mutants 
5 A thioredoxin-heregulin fusion (trx-hrg) was generated and purified as described 

elsewhere (see, e.g. Landgraf et aL (1998) Biochemistry 37(9), 3220-8). The HER3 point 
mutants were generated using the method of oligonucleotide directed mutagenesis 
described by Kunkel (see, e.g. Kunkel, T. A. (1988) Nucleic Acids Symp Ser (19), 43). Table 
1 contains the oligonucleotides used for the mutagenesis. The PCR fragments were 
10 subcloned into the Bgl 2 and EcoR-1 site of HER3 H1 (residues 20-329) and HER3 I ~ IV 
(residues 20-643) in the pMT/BiP/V5-His-A expression vector (Invitrogen, Carlsbad, 
CA) and confirmed by sequencing. The mutants were expressed and purified as 
described elsewhere (see, e.g. Singer et al. (2001) J Biol Chew 276(47), 44266-74). 

15 B. Quantitation of the HER3 Point Mutants 

The concentration of the mutants were determined by analyzing the samples by 
SDS-PAGE on a 4-15% gradient gel (Biorad) followed by Western Blotting and 
cheninurninescence using a monoclonal antibody directed against the V-5 epitope 
conjugated to horse radish peroxidase (Invitrogen). Each mutant was run in triplicate 

20 and compared to a known concentration of HER3 141 (see, e.g. Singer et al. (2001) / Biol 
Chem 276(47), 44266-74). 

C. BiudingAssays, Mass Spectrometry and CD Analysis: 

The gel shift assay, "pull down" assay, and surface plasmon resonance 
25 measurements were performed as described elsewhere (see, e.g. Singer et aL (2001) / Biol 
Chem 276(47), 44266-74). Matrix-assisted Laser Desorption Ionization (MALD1) tirne- 
of-flight mass spectrometry was performed as described elsewhere (see, e.g. Singer et al. 
(2001) / Biol Chem 276(47), 44266-74). The CD analysis was performed using a circular 
dichroism instrument from AVIV (Lakewood, NJ) with a 0.1 mM pathlength. The scan 
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was performed from 260nM to 195nM every 1.0 nM with 1.50 nM between scans and 
3.0 seconds to collect each point 

D. A mutation of residue 64 in HERJ to alanine abolishes hengulin binding 
5 HER3 WI E64A has a decreased affinity for heregulin binding. To locate the region 

of domain I that is critical for hrg binding, we expressed a series of point mutants in 
domain I of HER3 1-11 and assayed for hrg binding by two independent methods: a pull 
down assay and BIAcore analysis. To determine whether die mutants of HER3 UI are 
capable of binding hrg, we used a "pull down" assay in which we measured the HER3 MI 

10 that could be dissociated from immobilized thioredoxin-heregulin (trx-hrg) by the 
addition of hrg. Using trx-hrg coupled S-protein agarose, HER3 1 - 11 was eluted with hrg 
and detergent (Figure 10A, lanes 3 and 5). No HER3 WI E64A could be recovered by 
either elution with hrg or detergent (Figure 10A, lanes 4 and 6) This provides evidence 
that HER3 111 E64A does not bind hrg. 

15 To further confirm that HER3 1 " u E64A does not bind hrg, we analyzed the 

interaction by surface plasmon resonance (SPR, BIAcore). Trx-hrg fusion protein was 
immobilized on a BIAcore chip for these measurements. We have previously shown 
that HER3 111 showed binding to immobilized trx-hrg with a calculated equilibrium 
dissociation constant of 68 nM using SPR measurements (see, e.g. Singer et al. (2001) / 

20 Biol Chem 276(47), 44266-74). HER3 HI E « 4A had a Kd of 95 nM, HER3 1 * 11 ™<> A had a 
Kd of 15 nM, (Table 2) HER3 WI E64A had no detectable binding above 500 nM to the 
immobilized hrg on the BIAcore chip. (Figure 10B). All other mutants had a Kd similar 
to wild type. These results provide evidence that residues 44, 64, and 110 are in the 
heregulin binding region of HER3, and that residue 64 is critical for heregulin binding. 

25 The residue at position 51 was also mutated to alanine because it could be 

protected by hrg from proteolytic cleavage, but was not direcdy in the binding footprint 
(Singer, 2001). HER3 MI had the same Kd as HER3 1 ' 11 (Table 2). This provides 
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evidence that residue 51 contributes no binding energy ditecdy to HER3-heregulin 
interactions. 

To show that the decreased binding affinity of HER3 1 - 11 E64A is not a result of 
proteolysis or misfolding, we performed mass spectrometry and CD analysis of HER3 MI 
5 E64A . The molecular mass of HER3 rn is between 38.4 kDa and 39.8 kDa which is 
comparable to the molecular mass of HER3 1 " 11 (38.5 and 40 kDa) (Figure 11 A). This 
provides evidence that the polypeptide chain of HER3 1 ' 11 E64A is intact. The CD spectra 
of HER3 MI E64A and HER3 1 " 11 were essentially identical between 210nM and 260nM 
providing evidence that diey have the same secondary structure. (Figure 11B). The 

10 fluctuations in the signal between 190nM and 210nM in HER3 1 * 11 E64A are probably due 
to aggtegation. These results show that a mutation at residue 64 in HER3 1 " 11 has litde 
affect on the structure of the protein, but results in decreased affinity fot hrg. 

To show that mutation at residue 64 has an effect in the extracellular domain of 
HER3, we expressed a series of mutants including HER3 X - IV E64A and HER3 r ~ rv and 

15 assayed for hrg binding by two independent methods: a native gel analysis, and a "pull 
down" assay. To determine the effects of cleavage of HER3 IJV on its ability to bind 
hrg, a gel mobility shift assay was performed. In a native gel analysis, HERS 1 "™ shows 
an electrophoretic mobility similar to HER3 HVE64A HER3 1 * IV shows a discrete shift in 
the presence of a 1 :1 molar ratio of hrg to HER3 WV . HER3 i-™E64A d oes not gj^ft ' lts gei 

20 position in the presence of hrg. (Figure 12A). This finding provides evidence that 
HER3 I-IVE64A does not bind hrg. 

To confirm that HER3 I ' IVE64A has decreased binding affinity for hrg, we used a 
"pull down" assay in which we measured the HER3 I " rv and HER3 I - IVE64A that could be 
dissociated from immobilized thioredoxin-heregulin (trx-hrg) by the addition of hrg. 

25 Using trx-hrg coupled S-protein agarose, HER3 UV was eluted with hrg and detergent 
(Figure 12B, lanes 6 and 8), but not with PBS (Figure 12B, lane 4). No HER3 I " IV E <* A 
could be recovered by elution with PBS, hrg or detergent (Figure 12B, lanes 5, 7 and 9). 
This provided evidence that HERS 1 "™ E64A does not bind hrg. 
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E. Amino Acids 64 and 1 10 are aitical for heregulin binding 

In this study, we have identified two amino acids in discontinuous segments in 
domain I of HER3 which appear to be critical for heregulin binding. A mutation at 
5 residue 64 seemed to decrease all binding affinity in both recombinant H3 I_rv E64A and 
H3 I-DE64A ]sJo detectable binding could be measured by gel shift, Biacore, or pull down 
assay. H3 1 *** E64A seemed to be correctiy folded as determined by CD analysis. These 
experimental results provide evidence that the decrease in binding affinity observed with 
this mutation is probably due to direct effects on interaction between H3 1-11 E64A and 

10 heregulin rather than the consequences of misfolding or proteolysis of the mutant 
protein. A mutation at residue 110 (H3 WI V110A ) caused a 4 fold increase in binding 
affinity. A mutation at residue 44 (H3 111 E44A ) had a 1.5 fold decrease in binding affinity 
A mutation at the protected proteolysis site (H3 1-11 K 51 *) had no effect on binding 
affinity. These results provide evidence that amino acids 64 and 110 are critical for 

15 heregulin binding. 

F. Residues 64 and 1 10 map to the Hormone Binding Footprint in the IGF-1R Structure. 

Residues 64 and 110 are in domain I of HER3 and map to residues H30 and 
L81 in the hormone binding footprint in the IGF-1R (see, e.g. Garrett et al. (1998) Nature 
20 394(6691), 395-9). (Figure 13). They are within 12A of each other although they lie on 
different strands. They are within 22A of the protected proteolysis site (see, e.g. Singer et 
al. (2001) / Biol Chem 276(47), 44266-74). Their close proximity to each other and the 
proteolysis site provide evidence that both residues may be involved in ligand binding. 

25 G. ■ Residues 64 and 1 10 could form specific interactions with heregulin 

The five N-terminal residues of heregulin are required for specific interaction 
with HER3 (see, e.g. Jones et al. (1998) / Biol Chem 273(19), 11667-74). A valine lies at 
position 180 and a histidine is at position 178. H3 V110 and hrg V180 could form 
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hydrophobic interactions. H3 E64 and hrg HI 80 could form a salt bridge. (Figure 14). 
Mutation of these residues to alanine could disrupt these specific interactions. 

Residue 50 can be protected by hrg from proteolysis but still not be involved 
direcdy with hrg binding. Our results provide evidence that residue 50 is not direcdy 
5 involved in hrg binding because a mutation at this site had no effect on hrg binding 
affinity. We propose that the N-terminus of hrg binds to residues 64 and 110 and that 
-the omega loop blocks the protected proteolysis site (residue 50) by sterically hindering 
interaction with trypsin (Figure 15). This figure also shows how hrg in this orientation 
could fit into the cleft formed by domains I, II, and III as was proposed by (see, e.g. 

10 Garrett et al. (1998) Nature 394(6691), 395-9) and (see, e.g. Jorissen et al, (2000) Protein 
Science 9, 310-324) for die IGF-1R and IR. 

We have identified two residues in HER3 that appear to be involved in' 
hrg binding utilizing results from an alanine scanning mutagenesis of the related insulin 
receptor (see, e.g. Williams et al. (1995) ]BiolCbem 270(7), 3012-6) and information from 

15 the hormone binding footprint of the known structure of the Insulin growth Factor 
Receptor (see, e.g. Garrett et al (1998) Nature 394(6691), 395-9). A multiple sequence 
alignment of these three receptors allowed us to identify residues which were conserved 
among all three and were in or near the hormone binding footprint of the IGF-1R (see, 
e.g. Singer et aL (2001) JBiolCbem 276(47), 44266-74). This allowed us to choose which 

20 residues would have the highest probability of forming interactions with heregulin. We 
identified two residues that were critical for heregulin binding. We observed that 
complementary residues in heregulin that might form specific interactions with HER3. 
However, while this analysis implies direct molecular interactions between HER3 and 
heregulin, it is not direct proof. A high resolution structure of the HER3-heregulin 

25 complex is required to confirm these interactions. 
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Tables 



Table 1: Oligonucleotide sequences used for generating the point mutants. 





N-terminal primer 


C-terminal primer 


HER3 I-II 




HER3 I-II (residue 329) 


CTA CTC TCT AGA TCC GAG GGC AAC TCT 


TAC CGA TCT AGA TTT CGG 


(residue 20) 


(SEQ ID NO: 9) 


ACA GAG ACC CCC (SEQ ID 






NO: 10) 



HER3 
Mutants 


Plus Strand 


Minus Strand 


E43A 


GCGATGCTGCGAACCAATACC 
(SEQ ID NO: 11) 


GGTATTGGTTCGCAGCATCGC 
(SEQ ID NO: 12) 


N44A • 


CGATGCTGAGGCCCAATACCAG 
(SEQ ED NO: 13) 


CTGGTATTGGCGCTCAGCATCG 
(SEQ ID NO: 14) 


K51A 


GACACTGTACGCGCTCTACGAG 
(SEQ ID NO: 15) 


CTCGTAGAGCCGGTACAGTGTC 
(SEQ ID NO: 16) 


E64A 


GGGAACCTTGCGATTGTGCTC 
(SEQ ID NO: 17) 


GAGCACAATCGCAAGGTTCCC 
(SEQ ID NO: 18) 


V66A 


CTTGAGATTGCGCTCACGG 
(SEQ ID NO: 19) 


CGTGAGCGCAATCTCAAGGT 
(SEQ ID NO: 20) ' 


F94A 


CCATGAATGAAGCCTCTACTCTAC 
(SEQ ID NO: 21) 


GTAGAGTAGAGCGTTCATTCATGG 
(SEQ ED NO: 22) 


V110A 


GACCCAGGCCTACGATGGGAA 
(SEQ ID NO: 23) 


CTTCCCATCGTAGGCCTGGGTC 
(SEQ ID NO: 24) 



5 



Table 2: Dissociation constants for the binding of heregulin to alanine mutants of 
HER3 Recombinant HER3 alanine mutants were bound to immobilized hrg on the 
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BIAcore chip. HER3 1-11 showed binding to immobilized trx-hrg with a calculated 
equilibrium dissociation constant of 68 nM (see, e,g. Singer et al (2001) / Biol Cbem 
276(47), 44266-74). The mutants are designated by the residue being mutated followed 
by the number indicating the position in the HER3 sequence, followed by alanine. Thus 
V110A is a mutant in which valine at position 110 has been mutated to alanine. ND > 
500nM indicates that no detectable binding was observed above 500nM. These results 
indicate that a mutation of valine at position 110 increased binding affinity 5 fold, while a 
mutation of the glutamate at position 64 nearly abolished binding affinity. 



H3 1-II Mutant 


Kd(nM) 


E43A 


63.0 +/-6.4 


N44A 


95.5 +/- 15.6 


K51A 


63.0 +/- 6.4 


E64A 


ND > 500 nM 


V66A 


60.5 +/- 6.3 


VU0A 


15.4 +/-3.1 



10 



15 



Table 3: Additional amino acid residues in HER3 associated with heregulin binding as 
determined by homology, crystaflographic and solvent accessibility criteria. 



2) 



IGF-1R 



HER3 amino acid residues (SEQ ID NO: 



20 



25 



5-14 

27-29, 31-36 
52-61 

77-79, 81-90 

107-115 

133-141 



30-46 

61-63, 65-70 
88-96 

106-109, 111-120 

143-151 

169-176 



Residues include those in HER3 that lay on the face of the HER3 domain that includes 
the mutated residues characterized herein. 
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Table 4: Illustrative HER3, Heregulin and HER2 polynucleotide and polypeptide 
sequences: 

HER 3 Polynucleotide Sequence 

5 

ATGAGGGCGAACGACGCTCTGCAGGTGCTGGGCTTGCTTTTCAGCCTGGCCCGGGGCTCCGAGGTGGGC 
AACTCTCAGGCAGTGTGTC CTGGG ACTCTGAATGG C CTGAGTGTG AC CGGCGATG CTGAGAAC C AATAC 
CAGACACTGTACAAGCTCTACGAGAGGTGTGAGGTGGTGATGGGGAACCTTGAGATTGTGCTCACGGGA 
CACAATGCCGACCTGTCCTTCCTGCAGTGGATTCGAGAAGTGACAGGCTATGTCCTCGTGGCCATGAAT 

10 GAATTCTCTACTCTACCATTGCCCAACCTCCGCGTGGTGCGAGGGACCCAGGTCTACGATGGGAAGTTT 
GCCATCTTCGTCATGTTGAACTATAACACCAACTCCAGCCACGCTCTGCGCCAGCTCCGCTTGACTCAG 
CTCACCGAGATTCTGTCAGGGGGTGTTTATATTGAGAAGAACGATAAGCTTTGTCACATGGACACAATT 
GACTGGAGGGACATCGTGAGGGACCGAGATGCTGAGATAGTGGTGAAGGACAATGGCAGAAGCTGTCCC 
C CC TGTC ATGAGGTTTG CAAGGGGCG AT GCTGGGGTC CTGG ATCAGAAGAC TGC C AG AC ATTGACC AAG 

15 ACCATCTGTGCTCCTCAGTGTAATGGTCACTGCTTTGGGCCCAACCCCAACCAGTGCTGCCATGATGAG 
TGTGCCGGGGGCTGCTCAGGCCCTO\GGACACAGACTGCTTTGCCTGCCGGCACTTCAATGACAGTGGA 
GCGTGTGTACCTCGCTGTCCACAGCCTCTTGTCTACAACAAGCTAACTTTCCAGCTGGAACCCAATCCC 
C AC ACCAAGTATC AGT ATGG AGGAGTTTGTGTAGC C AGCTGTC CCC ATAACTTTGTGGTGGATC AAACA 
TCCTGTGTCAGGGCCTGTCCTCCTGACAAGATGGAAGTAGATAAAAATGGGCTCAAGATGTGTGAGCCT 

20 TGTGGGGGACTATGTCCCAAAGCCTGTGAGGGAACAGGCTCTGGGAGCCGCTTCCAGACTGTGGACTCG 
AG C AACATTGATGGATTTGTGAACTGCACC AAGATC CTGGGC AAC CTGGACTTTCTGATCAC CGGCCTC 
AATGGAGACCCCTGG CACAAGATCC CTGCCC TGGACCCAGAG AAGCTC AATGTCTTCCGGAC AGT ACG G 
GAGATC^CAGGTTACCTGAACATCCAGTCCTGGCCGCCCCACATGCACAACTTCAGTGTTTTTTCCAAT 
TTGACAACCATTGGAGGCAGAAGCCTCTACAACCGGGGCTTCTCATTGTTGATCATGAAGAACTTGAAT 

25 GTCACATCTCTGGGCTTCCGATCCCTGAAGGAAATTAGTGCTGGGCGTATCTATATAAGTGCCAATAGG 
CAGCTCTGCTACCACCACTCTTTGAACTGGACCAAGGTGCTTCGGGGGCCTACGGAAGAGCGACTAGAC 
ATCAAGCATAATCGGCCGCGCAGAGACTGCGTGGCAGAGGGCAAAGTGTGTGACCCACTGTGCTCCTCT 
GGGGGATGCTGGGGCCCAGGCCCTGGTCAGTGCTTGTCCTGTCGAAATTATAGCCGAGGAGGTGTCTGT 
GTGACCCACTGCAACTTTCTGAATGGGGAGCCTCGAGAATTTGCCCATGAGGCCGAATGCTTCTCCTGC 

30 CACCCGGAATGCCAACCCATGGGGGGCACTGCCACATGCAATGGCTCGGGCTCTGATACTTGTGCTCAA 
TGTGCCCATTTTCGAGATGGGCCCCACTGTGTGAGCAGCTGCCCCCATGGAGTCCTAGGTGCCAAGGGC 
CCAATCTACAAGTACCCAGATGTTCAGAATGAATGTCGGCCCTGCCATC 

AAAGGACCAGAGCTTCAAGACTGTTTAGGACAAACACTGGTGCTGATCGGCAAAACCCATCTGACAATG 
G CTTTG AC AGTGATAGC AGGATTGGTAGTGATTTT CATG ATGCTGGGCGG CACTTTTCTCTACTGG CGT 

35 GGGCGCCGGATTCAGAATAAAAGGGCTATGAGGCGATACTTGGAACGGGGTGAGAGCATAGAGCCTCTG 
GACCCCAGTGAGAAGGCTAACAAAGTCTTGGCCAGAATCTTCAAAGAGACAGAGCTAAGGAAGCTTAAA 
GTGCTTGGCTCGGGTGTCTTTGGAACTGTGCACAAAGGAGTGTGGATCCCTGAGGGTGAATCAATCAAG 
ATTCCAGTCTGCATTAAAGTCATTGAGGACAAGAGTGGACGGCAGAGTTTTCAAGCTGTGACAGATCAT 
ATGCTGGCCATTGGCAGCCTGGACCATGCCCACATTGTAAGGCTGCTGGGACTATGCCCAGGGTCATCT 

40 CTGCAGCTTGTCACTCAATATTTGCCTCTGGGTTCTCTGCTGGATCATGTGAGACAACACCGGGGGGCA 
CTGGGGCCACAGCTGCTGCTCAACTGGGGAGTACAAATTGCCAAGGGAATGTACTACCTTGAGGAACAT 
GGTATGGTGCATAGAAACCTGGCTGCCCGAAACGTGCTACTCAAGTCACCCAGTCAGGTTCAGGTGGCA 
GATTTTGGTGTG GCTGACCTGCTGCCTC CTG ATG ATAAG C AGC TGCTATAC AGTGAGG CCAAGACTC C A 
ATTAAGTGGATGGCCCTTGAGAGTATCCACTTTGGGAAATACAC^CACCAGAGTGATGTCTGGAGCTAT 

45 GGTGTG ACAGTTTGG G AGTTG ATGACCTTCGGGGCAGAG CCCTATG CAGGG CTACG ATTGGCTGAAGTA 
CCAGACCTGCTAGAGAAGGGGGAGCGGTTGGCACAGCCCCAGATCTGCACAATTGATGTCTACATGGTG 
ATGGTCAAGTGTTGGATGATTGATGAGAACATTCGCCCAACCTTTAAAGAACTAGCCAATGAGTTCACC 
AGGATGGCCCGAGACCCACCACGGTATCTGGTCATAAAGAGAGAGAGTGGGCCTGGAATAGCCCCTGGG 
CCAGAGCCCCATGGTCTGACAAACAAGAAGCTAGAGGAAGTAGAGCTGGAGCCAGAACTAGACCTAGAC 

50 CTAGACTTGGAAGCAGAGGAGGACAACCTGGCAACCACCACACTGGGCTCCGCCCTCAGCCTACCAGTT 
GGAACACTTAATCGGCCACGTGGGAGCCAGAGCCTTTTAAGTCCATCATCTGGATACATGCCCATGAAC 
CAGGGTAATCTTGGGGGGTCTTGCCAGGAGTCTGCAGTTTCTGGGAGCAGTGAACGGTGCCCCCGTCCA 
GTCTCTCTACACC C AATGCCACGGGG ATG C CTGG C ATC AG AGTCATC AGAGGGGCATGTAAC AGG CTCT 
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GAGGCTGAGCTCCAGGAGAAAGTGTCAATGTGTAGAAGCCGGAGCAGGAGCCGGAGCCCACGGCCACGC 
GGAGATAGCGCCTACCATTCCCAGCGCCACAGTCTGCTGACTCCTGTTACCCCACTCTCCCCACCCGGG 
TTAGAGG AAGAGGATGT C AACGGTTATGTC ATGCCAG ATAC AC AC CTCAAAGGT ACTCCCTC CTCCCGG 
GAAGGCACCCTTTCTTCAGTGGGTCTCAGTTCTGTCCTGGGTACTGAAGAAGAAGATGAAGATGAGGAG 
5 TATGAATACATGAACCGGAGGAGAAGGCACAGTCCACCTCATCCCCCTAGGCCAAGTTCCCTTGAGGAG 
CTGGGTTATGAGTAC ATGGATGTGGGGTC AGAC C TC AGTG CCTCTCTGGG C AGC AC ACAGAGTTGC CC A 
CTC C ACCCTGT AC CC ATC ATG CC CACTG CAGGC ACAACTCC AG ATGAAG ACTATG AATATATGAATCGG 
CAACGAGATGG AGGTGGTCCTGGGGGTGATTATG CAGCC ATG GG GGC CTG CC C AG CATCTGAGC AAGGG 
TATGAAGAG ATG AG AGC TTTT CAGGGGC CTGGAC ATC AGGC CCC C CATGTC CATTATGCCCGCCTAAAA 
10 ACTCTACGTAGCTTAGAGGCTACAGACTCTGCCTTTGATAACCCTGATTACTGGCATAGCAGGCTTTTC 
CCC AAGGCTAATG CC C AGAGAACG SEQ ID NO: 1 



15 HER 3 Polypeptide Sequence 

mrandalqvlgllfs larg s evgnsqavcpgtlnglsvtgdaenqyqtlykl ye 

hnadlsflqwirevtgyvlvamnefstlplpnlrwrgtqvydgkfaif 

lteilsggvyiekndklchmdtidtoditodrdaeiwkdngrscppchevckgrcwgpgsedcq 

20 ticapqcnghcfgpnpnqcchdecaggcsgpqdtdcfacrhfndsgacvprcpqplvynkltfqlepnp 
htkyqyggvcvascphnfvvdqtscvracppdkmevdknglkmcepcgglcpkacegtgsgsrfqtvds 
snidgfvnctkilgnldflitglngdpwhkipaldpeklnvfrttoeitc^ 
lttiggrslynrgfsllimknlnvtslgfrslkeisagriyisanrqlcyhhslnwtkvlrgpteerld 
ikhnrprrdctaegkvcdplcssggcwgpgpgqclscritc^ 

25 hpecqpmggtatcngsgsdtc^qcahfrdgphcvsscphgvlgakgpiykypdvqnecrpchenctqgc 

KGPELQDCLGQTLVTjIGKTHIjTMALTVI AGLWI FMMLGGTFLYWRGRRIQNKRAMRRYLERGES IEPL 
DPSEKANKVLARIFKETELRKLKVLGSGWGTVHKGWIPEGESIKIPVCIKVIEDKSGRQSFQAVTDH 
MLAIGSLDHAHITOLLGLCPGSSLQLVTQYLPLGSLLDHTOQHRGALGPQLLLIiWGVQIAKGMYYLEE 
GMVHRNLAARNVLLKSPSQVQVADFGVADLLPPDDKQLLYSEAKOT^ 

30 GVTVWELMTFGAEPYAGLRLAEVPDLLEKGERLAQPQICTIDVYMVMVKCWMIDENIRPTFKELA^ 

RMARD P PR YL VI KRE SGPG I APGPE PHGLTNKKLEE VELE PELDLDLDL EAE EDNLATTTLGS AL S LP V 
GTLNRPRGSQSLLSPSSGYMPMNQGNLGGSCQESAVSGSSERCPRPVSLHPMPRGCLASESSEGHVTGS 
EAELQEKVSMCRSRSRSRSPRPRGDSAYHSQRHSLLTPVTPLSPPGLEEEDVNGYVMPDTHLKGTPSSR 
EGTLSSVGLSSVLGTEEEDEDEEYEYMNRRRRHSPPHPPRPSSLEELGYEYMDVGSDLSASLGSTQSCP 

35 LHPVPIMPTAGTTPDEDYEYMNRQRDGGGPGGDYAAMGACPASEQGYEEMRAFQGPGHQAPHVHYARLK 
TLRSLEATDSAFDNPDYWHSRLFPKANAQRT SEQ ID NO: 2 



Heregulin Polynucleotide Sequence 

40 

ATGTCCGAGCGCAAAGAAGGCAGAGGCAAAGGGAAGGGCAAGAAGAAGGAGCGAGGCTCCGGCAAGAAG 
CCGGAGTCCGCGGCGGGCAGCCAGAGCCCAGCCTTGCCTCCCCAATTGAAAGAGATGAAAAGCCAGGAA 
TCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAG 
TGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATAC^ 
45 GGGAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGCAAAGTGATC 
AGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACC^^ 

ATGCCAGCCTCAACTGAAGGAGCATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAA 
GGAGC^UU^TACTTCTTCATCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAG 
AAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGA 
50 TACTTGTGCAAGTGCCC AAATGAGTTTACTGGTGAT CG CTGC C AAAACTACGTAATGGCC AGCTTC TAC 
AAGCATCTTGGGATTGAATTTATGGAGGCGGAGGAGCTGTACCAGAAGAGAGTGCTGACCATAACCGGC 
ATCTGCATCGCCCTCCTTGTGGTCGGCATCATGTGTGTGGTGGCCTACTGCAAAACCAAGAAACAGCGG 
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AAAAAGCTGCATGACCGTCTTCGGCAGAGCCTTCGGTCTGAACGAAACAATATGATGAACATTGCCAA^ 
GGG CCTCAC C ATCCTAAC CC ACC CC CCGAGAATGTCCAGCTGGTGAATCAAT ACGTATC TAAAAACGTC 
ATCTC CAGTG AG C ATATTGTTG AG AGAGAAGC AG AG AC ATC CTTTTCCACC AGTC ACT AT ACTTC CAC A 
GCCCATCACTCCACTACTGTCACCCAGACTCCTAGCCACAGCTGGAGCAACGGACACACTGAAAGCATC 
5 CTTTCCGAAAGCCACTCTGTAATCGTGATGTCATCCGTAGAAAACAGTAGGCACAGCAGCCCAACTGGG 
GG C C C AAG AGG ACGTCT TAATGG CAC AGGAGG CCCT CGTGAATGTAACAGCTT CCTC AGGCATGC C AGA 
GAAAC C CCTG ATTCCT ACCGAG ACTCTC CTCATAGTGAAAGGTATGTGTC AGC CATGACC ACCC CGG C T 
CGTATGTCACCTGTAGATTTCCACACGCCAAGCTCCCCCAAATCGCCCCCTTCGGAAATGTCTCCACCC 
GTGTCC AG C ATGACGGTGTC C ATGCCTTCC ATGG CGGT CAG CC C CTTCATGG AAGAAGAGAGAC CT CTA 

10 CTTCTCGTGACACCACCAAGGCTGCGGGAGAAGAAGTTTGACCATCACCCTCAGCAGTTCAGCTCCTTC 
CACCACAACCCCGCGCATGACAGTAACAGCCTCCCTGCTAGCCCCTTGAGGATAGTGGAGGATGAGGAG 
TATGAAACGACCCAAGAGTACGAGCCAGCCCAAGAGCCTGTTAAGAAACTCGCCAATAGCCGGCGGGCC 
AAAAGAAC C AAGCCC AATGGC C AC ATTGCTAACAG ATTGGAAGTGGAC AGCAAC AC AAG CTCCC AGAGC 
AGTAACTCAGAGAGTGAAACAGAAGATGAAAGAGTAGGTGAAGATACGCCTTTCCTGGGCATACAGAAC 

15 CCCCTGGCAGCCAGTCTTGAGGCAACACCTGCCTTCCGCCTGGCTGACAGCAGGACTAACCCAGCAGGC 
CG CTTCTCGAC AC AGGAAG AAATCCAGGC CAGGCTGTC TAG TGTAATTGCTAAC CAAGACCCTATTGCT 
GTA SEQ ID NO : 3 



20 Heregulin Polypeptide Sequence 

MSERKEGRGKGKGKKKERGSGKKPESAAGSQSPALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRF 
WFKNGNELNRKNKPQNIKIQKK3>GKSELRINKASLADSGEYMCK^ 

MPASTEGAYVSSESPIRISVSTEGANTSSSTSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSR 
25 YLCKCPNEFTGDRCQNYVMASFYKHLGIEFM^ 

KKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQW 

AHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHAR 
ETPD S YRDS PHS ER YVS AMTT P ARMS P VD,FHTP SSPKSPPSEMSP P VS S MTVS MPS MAVS P FMEE ERPL 
LL VTP PRLREKKFDHHPQQ F S S FHHNPAHDSNSLP AS PLR I VEDEE YETTQE YE PAQEP VKKLANS RRA 
30 KRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLiGIQNPLAASLEATPAFRLADSRTNPA 
RFSTQEEIQARLSSVIANQDPIAV SEQ ID NO: 4 



HER2 Polynucleotide Sequence 

35 

ATGGAGCTGGCGGCCTTGTGCCGCTGGGGGCTCCTCCTCGCCCTCTTGCCCCCCGGAGCCGCGAGCACC 
CAAGTGTGCACCGGCACAGACATGAAGCTGCGGCTCCCTGCCAGTCCCGAGACCCACCTGGACATGCTC 
CGCCACCTCTACCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCTACCTGCCCACCAATGCC 

40 AGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTACGTGCTCATCGCTCACAACCAAGTGAGG 
CAGGTCCCACTGCAGAGGCTGCGGATTGTGCGAGGCACCCAGCTCTTTGAGGACAACTATGCCCTGGCC 
GTGCTAGACAATGGAGACCCGCTGAACAATACCACCCCTGTCACAGGGGCCTCCCCAGGAGGCCTGCGG 
GAGCTGC^GCTTCGAAGCCTCACAGAGATCTTGAAAGGAGGGGTCTTGATCCAGCGGAACCCCCAGCTC 
TGCTACCAGGACACGATTTTGTGGAAGGACATCTTCCACAAGAACAACCAGCTGGCTCTCACACTGATA 

45 GACACCAACCGCTCTCGGGCCTGCCACCCCTGTTCTCCGATGTGTAAGGGCTCCCGCTGCTGGGGAGAG 
AGTTCTGAGGATTGTCAGAGCCTGACGCGCACTGTCTGTGCCGGTGGCTGTGCCCGCTGCAAGGGGCCA 
CTGCCCACTGACTGCTGCCATGAGCAGTGTGCTGCCGGCTGCACGGGCCCCAAGCACTCTGACTGCCTG 
GCCTGCCTCCACTTCAACC^C^GTGGCATCTGTGAGCTGC^CTGCCCAGCCCTGGTCACCrrAC^CACA 
GACACGTTTGAGTCCATGCCCAATCCCGAGGGCCGGTATACATTCGGCGCCAGCTGTGTGACTGCCTGT 

50 CCCTACAACTACCTTTCTACGGACGTGGGATCCTGCACCCTCGTCTGCCCCCTGCACAACCAAGAGGTG 
AC AGCAGAGG ATGGAACAC AGCGGTGTG AGAAGTGCAG CAAGC CCTGTGC C CGAGTGTGCTATGGTCTG 
GGCATGGAGCACTTGCGAGAGGTGAGGGCAGTTACCAGTGCCAATATCCAGGAGTTTGCTGGCTGCAAG 
AAGATCTTTGGGAGCCTGGCATTTCTGCCGGAGAGCTTTGATGGGGACCCAGCCTCCAACACTGCCCCG 
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CTC CAG CC AG AG CAG CTCCAAGTGTTTG AG ACTCTGGAAGAGATCAC AGGTTAC CT ATA.C AT CTC AGC A 
TGGCCGGACAGC CTG C CTG ACCT C AG CGTCTTCCAGAACCTGC AAGTAATCCGGG G ACGAATTCTGCAC 
AATGGCGCCTACTCGCTGACCCTGCAAGGGCTGGGCATCAGCTGGCTGGGGCTGCGCTCACTGAGGGAA 
CTGGGCAGTGGACTGGCCCTCATCCACCATAACACCCACCTCTGCTTCGTGCACACGGTGCCCTGGGAC 
5 CAGCTCTTTCGGAACCCGCACCAAGCTCTGCTCCACACTGCCAACCGGCCAGAGGACGAGTGTGTGGGC 
GAGGGCCTGGCCTGCCACCAGCTGTGCGCCCGAGGGCACTGCTGGGGTCCAGGGCCCACCCAGTGTGTC 
AACTGCAGC C AGTTCCTTCGGGGC CAGGAGTGCGTGGAGGAATGCCGAGTAC TG C AGGGGCTCCC C AGG 
G AGTATGTGAATGC C AGGC ACTGTTTG CCGTGCCAC CCTGAGTGTCAG CCC C AG AATGG CTCAGTG AC C 
TGTTTTGGACCGG AGGCTG ACC AGTGTGTGG CCTGTGC CC ACTAT AAG GACCCTC CCTT CTG CGTGG C C 

10 CGCTG CCC C AG CGGTGTGAAACCTGAC CTCTCCTACATGCC C ATCTGGAAGTTT CC AGATG AGGAGGGC 
GCATGCCAGCCTTGCCCCATCAACTGCACCCACTCCTGTGTGGACCTGGATGACAAGGGCTGCCCCGCC 
GAGCAGAGAGCCAGCCCTCTGACGTCCATCGTCTCTGCGGTGGTTGGCATTCTGCTGGTCGTGGTCTTG 
GGGGTGGTCTTTGGGATCCTCATCAAGCGACGGCAGCAGAAGATCCGGAAGTACACGATGCGGAGACTG 
CTGCAGGAAACGGAGCTGGTGGAGCCGCTGACACCTAGCGGAGCGATGCCCAACCAGGCGCAGATGCGG 

15 ATCCTGAAAGAGACGGAGCTGAGGAAGGTGAAGGTGCTTGGATCTGGCGCTTTTGGCACAGTCTACAAG 
GGCATCTGGATCCCTGATGGGGAGAATGTGAAAATTCCAGTGGCCATCAAAGTGTTGAGGGAAAACACA 
TCC CCC AAAGC C AACAAAGAAATCTTAGACG AAGCATACGTG ATGGCTGGTGTGGGCTC CCCATATGT C 
TCCCGCCTTCTGGGCATCTGCCTGACATCCACGGTGCAGCTGGTGACACAGCTTATGCCCTATGGCTGC 
CTCTTAGACCATGTCCGGGAAAACCGCGGACGCCTGGGCTCCCAGGACCTGCTGAACTGGTGTATGCAG 

20 ATTGCCAAGGGGATGAGCTACCTGGAGGATGTGCGGCTCGTACACAGGGACTTGGCCGCTCGGAACGTG 
CTGGTCAAGAGTCCCAACCATGTCAAAATTACAGACTTCGGGCTGGCTCGGCTGCTGGACATTGACGAG 
ACAGAGTACCATGCAGATGGGGGCAAGGTGCCCATCAAGTGGATGGCGCTGGAGTCCATTCTCCGCCGG 
CGGTTCACCCACCAGAGTGATGTGTGGAGTTATGGTGTGACTGTGTGGGAGCTGATGACTTTTGGGGCC 
AAAC CTTACGATG GG ATCC C AG CCCGGGAGATC CC TG AC C TG C TGG AAAAGGGGG AG CGGCTGCC C CAG 

25 CCC C C C ATCTGCACCATTG ATGTCTAC ATGATC ATGGTCAAATGTTGG ATGATTG ACTC TGAATGTCGG 
CCAAGATTCCGGGAGTTGGTGTCTGAATTCTCCCGCATGGCCAGGGACCCCCAGCGCTTTGTGGTCATC 
CAG AATGAG G AC TTGGG CC C AG C CAGTC CCTTGGAC AGC AC CTTCTAC CGCTC ACTG CTGGAGGACG AT 
GACATGGGGGACCTGGTGGATGCTGAGGAGTATCTGGTACCCCAGCAGGGCTTCTTCTGTCCAGACCCT 
GCCCCGGGCGCTGGGGGCATGGTCCACCACAGGCACCGCAGCTCATCTACCAGGAGTGGCGGTGGGGAC 

30 CTGACACTAGGGCTGGAGCCCTCTGAAGAGGAGGCCCCCAGGTCTCCACTGGCACCCTCCGAAGGGGCT 
GGCTCCGATGTATTTGATGGTGACCTGGGAATGGGGGCAGCCAAGGGGCTGCAAAGCCTCCCCACACAT 
GACCCCAGCCCTCTACAGCGGTACAGTGAGGACCCCACAGTACCCCTGCCCTCTGAGACTGATGGCTAC 
GTTGCCCCCCTGACCTGCAGCCCCCAGCCTGAATATGTGAACCAGCCAGATGTTCGGCCCCAGCCCCCT 
TCGCCCCGAGAGGGCCCTCTGCCTGCTGCCCGACCTGCTGGTGCCACTCTGGAAAGGGCCAAGACTCTC 

35 TCCCCAGGGAAGAATGGGGTCGTCAAAGACGTTTTTGCCTTTGGGGGTGCCGTGGAGAA.CCCCGAGTAC 
TTGACACCCGAGGGAGGAGCTGCCCCTCAGCCCCACCCTCCTCCTGCCTTCAGCCCAGCCTTCGACAAC 
CTCTATTACTGGGACCAGGACCCACCAGAGCGGGGGGCTCCACCCAGCACCTTCAAAGGGACACCTACG 
GCAGAGAACCCAGAGTACCTGGGTCTGGACGTGCCAGTG SEQ ID NO : 5 



HER2 Polypeptide Sequence 
MELAALCRWGLLLALLPPGAASTQVCTGTDMKL^ 

S LS FLQDI QEVQGYVL I AHNQVRQVPLQRLR ITOGTQLFEDNYALAVLDNGDPLNNTTP VTGAS PGGLR 
45 ELQLRSLTEILKGGVLIQRNPQLCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGE 
SSEDCQSLTRTVCAGGCARCKGPLPTDCCHEQC^GCTGPKHSDCLACLHFNHSGICELjHCPALVTYNT 
DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTIjVCPLHNQEVTAEDGTQRCEKCSKPCARVCYGL 

gmehlrevravtsaniqefagckkifgslaflpesfdgdpasntaplqpeqlqvfetleeitgylyisa 
wpdslpdlsvfqnlqvirgrilhngaysltlqglgiswlglrslrelgsglalihhnthlcfvintvpwd 

50 QLFRNPHQALLHTANRPEDECVGEGLACHQLCARGHCWGPGPTQCVXTCSQFLRGQECVEECRVLQGIjPR 

eywarhclpchpecqpqngsvtcfgpeadqctac^hykdppfc^ 

acqpcpincthsc^lddkgcpaeqraspltsivsawgillvvvlgvwgilikrrqqkirkytmrr^ 
lqetelvepltpsgampnqaqmrilketelrkvkvlgsgafgtwk^ 

S PKANKEILDEAYVMAGVGS P WSRLLG ICLTSTVQLVTQIjMP ygclldhvrenrgrlg sqdllnwcmq 
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I AKGMS YLEDVRL VHRDLAARNVLVKS PNHVKI TDFGLARLLD IDETE YHADGGKVP I KWMALES I LRR 
RFTHQSDVWSYGVTVWELMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECR 
PRFRELVSEFSRMARDPQRFWIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGFFCPDP 
APGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDGDLGMGAAKGLQSLPTH 
5 DPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVRPQPPSPREGPLPAARPAGATLERAKTL 
SPGKNGWKDVFAFGGAVENPEYLTPQGGAAPQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPT 
AENP E YLGLD VP V SEQ ID NO: 6 
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WHAT IS CLAIMED IS: 

1. A non-naturally occurring HEE3 variant polypeptide consisting 
essentially of amino acid 20 to amino acid 329 of HER3 polypeptide sequence of SEQ 

5 ID NO: 2; wherein the HER3 variant polypeptide: 

(a) specifically binds to the heregulin polypeptide of SEQ ID NO: 4; and 

(b) exhibits an impaired ability to interact with the HER2 polypeptide of SEQ ID 

NO: 6. 

2. A non-naturally occurring HER3 variant polypeptide consisting 
10 essentially of amino acid 19 to amino acid 329 of HER3 polypeptide sequence of SEQ 

ID NO: 2; wherein the HER3 variant polypeptide has the ability to inhibit the interaction 
between wild type HER3 having the polypeptide sequence of SEQ ID NO: 2 and 
heregulin having the polypeptide sequence of SEQ ID NO: 4. 

3 The HER3 variant polypeptide of claim 1 further comprising one or 
15 more amino acid substitutions at the following residue position(s) in SEQ ID NO: 2: 
E43; N44; K51; E64; V66; and VI 10. 

4. A HER3 variant polypeptide comprising an amino acid sequence which 
differs from the native sequence HER3 polypeptide sequence of SEQ ID NO: 2 and has 
one or more of amino acid substitutions at the following residue position(s) in SEQ ID 

20 NO: 2: E43; N44; K51; E64; V66; and VI 10. 

5. An isolated nucleic acid comprising a nucleotide sequence encoding the 
HER3 variant of claim 4. 

6. A vector comprising the nucleic acid of claim 5. 

7. A host cell comprising the vector of claim 6. 

25 8. The host cell of claim 7 wherein said host cell is E. colt. 

9. A method of making HER3 variant polypeptide, comprising the steps of: 
providing a host cell comprising the vector of claim 6; (b) providing culture media; (c) 
culturing the host cell in the culture media under conditions sufficient to express the 
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HER3 variant polypeptide; (d) recovering the HER3 variant polypeptide from the host 
cell or culture media; and (e) purifying the HER3 variant polypeptide. 

10. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
polypeptide is conjugated or linked to one or more polyol groups. 
5 11. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 

polypeptide has a substitution at VI 1 0. 

12. The HER3 variant polypeptide of claim 11, wherein the HER3 variant 
polypeptide has an alanine substitution at VI 10. 

13. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
10 polypeptide has a substitution at E64. 

14. The HER3 variant polypeptide of claim 4, wherein the HER3 variant 
polypeptide comprises a soluble, extracellular domain HER3 polypeptide. 

15. The isolated HER3 variant polypeptide of claim 1, wherein the HER3 
variant polypeptide does not have any portion of the amino acid sequence consisting of 

15 amino acid 330 to amino acid 1342 of HER3 polypeptide sequence of SEQ ID NO: 2. 

16. A method of identifying a compound which specifically binds the 
heregulin binding domain in a HER3 variant polypeptide comprising amino acid 20 to 
amino acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2 comprising contacting 
the HER3 variant polypeptide with a test compound under conditions favorable to 

20 binding and determining whether the test compound specifically binds to the HER3 ■ 
variant polypeptide such that a compound which binds to the HER3 variant polypeptide 
can be identified. 

17. The method of Claim 16, further comprising determining whether die 
test compound inhibits the heregulin induced tyrosine kinase activity associated with a 

25 HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising the steps 
of: 

contacting mammalian cells that express HER3 receptor with heregulin in the 
presence and absence of the test compound; and 
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monitoring the mammalian cells for the tyrosine kinase activity associated with 
the HER3 polypeptide, wherein an inhibition in tyrosine kinase activity in the presence 
of the test compound as compared to the absence of the test compound identifies the 
test compound as an inhibitor of heregulin induced tyrosine kinase activity. 
5 18. The method of Claim 16, further comprising determining whether the 

test compound enhances the heiegulin induced tyrosine kinase activity associated with a 
HER3 polypeptide having the sequence shown in SEQ ID NO: 2 comprising the steps 
of: 

contacting mammalian cells that express HER3 receptor with heregulin in the 
10 presence and absence of the test compound; and 

monitoring the mammalian cells for the tyrosine kinase activity of the HER3 
polypeptide, wherein an increase in tyrosine kinase activity in the presence of the test 
compound as compared to the absence of the test compound identifies the test 
compound as an enhancer of heregulin induced tyrosine kinase activity. 
.15 19. A method of determining whether a test compound modulates the 

interaction between heregulin polypeptide of SEQ ID NO: 4 and a HER3 variant 
polypeptide comprising amino acid 20 to amino acid 329 of HER3 polypeptide sequence 
of SEQ ID NO:2 comprising the steps of: 

(a) contacting the HER3 variant polypeptide with a test compound under 
20 conditions favorable to binding; 

(b) contacting the HER3 variant polypeptide of step (a) with heregulin under 
conditions favorable to binding; 

(c) comparing the binding interaction between the HER3 variant polypeptide and 
heregulin of step (b) with the binding interaction between HER3 variant polypeptide and 

25 heregulin in the absence of the test compound such that a compound which modulates 
the interaction between heregulin and the HER3 variant polypeptide can be identified. 

20. A method of inhibiting the interaction between a heregulin polypeptide 
having the sequence shown in SEQ ID NO: 4 and HER3 polypeptide having the 
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sequence shown in SEQ ID NO: 2 comprising exposing the heregulin polypeptide to a 
non-naturally occurring HER3 variant polypeptide comprising amino acid 20 to amino 
acid 329 of HER3 polypeptide sequence of SEQ ID NO: 2; wherein the HER3 variant 
polypeptide specifically binds to the heregulin polypeptide of SEQ ID NO: 4; and 
5 exhibits an impaired ability to interact with the HER2 polypeptide of SEQ ID NO: 6. 

21. The method of claim 20, wherein the HER3 variant polypeptide has one 
or more of amino acid substitutions at the following residue position (s) in SEQ ID NO: 
2: E64 and VI 10. 

22. The method of claim 20, wherein the HER3 variant polypeptide has an 
10 amino acid substitution at VI 10, 

23. The method of claim 20, wherein the amino acid substitution is V91A. 

24. The method of claim 20, wherein the HER3 variant polypeptide is 
conjugated or linked to one or more polyol groups. 

25. Trie method of claim 24, wherein the polyol group is polyethylene glycol. 

15 
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SEQUENCE LISTING 

<110> The Regents of the University of California et al . 

<12 0> METHODS AND MATERIALS FOR CHARACTERIZING 

AND MODULATING INTERACTIONS BETWEEN HEREGULIN AND HER3 



<130> 30448103WOU1 

<150> 60/308,431 
<151> 2001-07-27 

<160> 24 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 4026 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1) . . . (4026) 
<400> 1 

atg agg gcg aac gac get 
Met Arg Ala Asn Asp Ala 
1 -5 

gec egg ggc tec gag gtg 
Ala Arg Gly Ser Glu Val 
20 

ctg aat ggc ctg agt gtg 
Leu Asn Gly Leu Ser Val 
35 

ctg tac aag etc tac gag 
Leu Tyr Lys Leu Tyr Glu 
50 

att gtg etc acg gga cac 
lie Val Leu Thr Gly His 
65 70 

cga gaa gtg aca ggc tat 
Arg Glu Val Thr Gly Tyr 
85 

eta cca ttg ccc aac etc 



ctg cag gtg ctg ggc ttg ctt ttc age ctg 48 
Leu Gin Val Leu Gly Leu Leu Phe Ser Leu 
10 15 

ggc aac tct cag gca gtg tgt cct ggg act 96 
Gly Asn Ser Gin Ala Val Cys Pro Gly Thr 
25 30 

acc ggc gat get gag aac caa tac cag aca 144 
Thr Gly Asp Ala Glu Asn Gin Tyr Gin Thr 
40 45 

agg tgt gag gtg gtg atg ggg aac ctt gag 192 
Arg Cys Glu Val Val Met Gly Asn Leu Glu 
55 60 

aat gec gac etc tec ttc ctg cag tgg att 240 
Asn Ala Asp Leu Ser Phe Leu Gin Trp He 
75 80 

gtc etc gtg gec atg aat gaa ttc tct act 2 88 
Val Leu Val Ala Met Asn Glu Phe Ser Thr 
90 95 

c Efc gtg gtg cga ggg acc cag gtc tac gat 336 
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Leu 


Pro 


Leu 


Pro 
100 


Asn 


Leu 


Arg 


Val 


Val 
105 


Arg 


Gly Thr Gin 


Val Tyr 
110 


Asp 


ggg 

Gly 


aag 
Lys 


ttt 
Phe 
115 


gec 
Ala 


ate 
He 


ttc 
Phe 


gtc 
Val 


atg 
Met 
120 


ttg 
Leu 


aac 
Asn 


tat aac acc 
Tyr Asn Thr 
125 


aac tec 
Asn Ser 


age 
Ser 


cac 
His 


get 
Ala 
130 


ctg 
Leu 


cgc 
Arg 


cag 
Gin 


etc 
Leu 


cgc 
Arg 
V>5 


ttg 
Leu 


act 
Thr 


cag 
Gin 


etc acc gag 
Leu Thr Glu 
140 


att ctg 
He Leu 


tea 
Ser 


ggg 

Gly 
145 


ggt 
Gly 


gtt 

Val 


tat 
Tyr 


att 
He 


gag 

Glu 
150 


aag 
Lys 


aac 
Asn 


gat 

Asp 


aag 
Lys 


ctt tgt cac 
Leu Cys His 
155 


atg gac 
Met Asp 


aca 
Thr 
160 


att 
He 


gac 
Asp 


tgg 
Trp 


agg 
Arg 


gac 
Asp 
165 


ate 
He 


gtg 

Val 


agg 
Arg 


gac 
Asp 


cga 
Arg 
170 


gat get gag 
Asp Ala Glu 


ata gtg 
lie Val 
175 


gtg 
Val 


aag 
Lys 


gac 
Asp 


aat 
Asn 


ggc 
Gly 
180 


aga 
Arg 


age 
Ser 


tgt 
Cys 


ccc 
Pro 


CCC 

Pro 
185 


tgt 

Cys 


cat gag gtt 
His Glu Val 


tgc aag 
Cys Lys 
190 


ggg 

Gly 


cga 
Arg 


tgc 
Cys 


tgg 
Trp 
195 


ggt 
Gly 


cct 
Pro 


gga 
Gly 


tea 
Ser 


gaa 
Glu 
200 


gac 

Asp 


tgc 
Cys 


cag aca ttg 
Gin Thr Leu 
205 


acc aag 
Thr Lys 


acc 
Thr 


ate 
He 


tgt 
Cys 
210 


get 
Ala 


cct 
Pro 


cag 
Gin 


tgt 
Cys 


aat 
Asn 
215 


ggt 

Gly 


cac 
His 


tgc 
Cys 


ttt ggg ccc 
Phe Gly Pro 
220 


aac ccc 
Asn Pro 


aac 
Asn 


cag 
Gin 
225 


tgc 
Cys 


tgc 
Cys 


cat 
His 


gat 
Asp 


gag 
Glu 
230 


tgt 
Cys 


gee 
Ala 


ggg 

Gly 


ggc tgc tea ggc 
Gly Cys Ser Gly 
235 


cct cag 
Pro Gin 


gac 
Asp 
240 


aca 
Thr 


gac 
Asp 


tgc 
Cys 


ttt 
Phe 


gec 
Ala 
245 


tgc 
Cys 


egg 
Arg 


cac 
His 


ttc 
Phe 


aat gac agt gga 
Asn Asp Ser Gly 
250 


gee tgt 
Ala Cys 
255 


gta 
Val 


cct 
Pro 


cgc 
Arg 


tgt 
Cys 


cca 
Pro 
260 


cag 
Gin 


cct 
Pro 


ctt 
Leu 


gtc 
Val 


tac 
Tyr 
265 


aac aag eta act 
Asn Lys Leu Thr 


ttc cag 
Phe. Gin 
270 


ctg 
Leu 


gaa 
Glu 


ccc 
Pro 


aat 
Asn 
275 


ccc 
Pro 


cac 
His 


acc 
Thr 


aag 
Lys 


tat 
Tyr 
280 


cag 
Gin 


tat gga gga gtt 
Tyr Gly Gly Val 
285 


tgt gta 
Cys Val 


gec 
Ala 


age 
Ser 


tgt 
Cys 
290 


ccc 
Pro 


cat 
His 


aac 
Asn 


ttt 
Phe 


gtg 

Val 
295 


gtg 

Val 


gat 
Asp 


caa aca tec tgt 
Gin Thr Ser Cys 
300 


gtc agg 
Val Arg 


gee 
Ala 


tgt 
Cys 


cct 
Pro 


cct 
Pro 


gac 
Asp 


aag 
Lys 


atg 
Met 


gaa 
Glu 


gta 
Val 


gat 
Asp 


aaa aat ggg etc 
Lys Asn Gly Leu 


aag atg 
Lys Met 


tgt 
Cys 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 
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305 310 315 320 

gag cct tgt ggg gga eta tgt ccc aaa gec tgt gag gga aca ggc tct 
1008 

Glu Pro Cys Gly Gly Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser 

325 330 335 

999 age cgc ttc cag act gtg gac teg age aac att gat gga ttt gtg 
1056 

Gly Ser Arg Phe Gin Thr Val Asp Ser Ser Asn lie Asp Gly Phe Val 
340 345 350 

aac tgc acc aag ate ctg ggc aac ctg gac ttt ctg ate acc ggc etc 
1104 

Asn Cys Thr Lys lie Leu Gly Asn Leu Asp Phe Leu lie Thr Gly Leu 
355 360 365 

aat gga gac ccc tgg cac aag ate cct gee ctg gac cca gag aag etc 
1152 

Asn Gly Asp Pro Trp His Lys He Pro Ala Leu Asp Pro Glu Lys Leu 
370 375 380 

aat gtc ttc egg aca gta egg gag ate aca ggt tac ctg aac ate cag 
1200 

Asn Val Phe Arg Thr Val Arg Glu He Thr Gly Tyr Leu Asn He Gin 

385 390 395 400 

tec tgg ccg ccc cac atg cac aac ttc agt gtt ttt tec aat ttg aca 
1248 

Ser Trp Pro Pro His Met His Asn Phe Ser Val Phe Ser Asn Leu Thr 

405 410 415 

acc att gga ggc aga age etc tac aac egg ggc ttc tea ttg ttg ate 
1296 

Thr He Gly Gly Arg. Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu lie 
420 425 430 

atg aag aac ttg aat gtc aca tct . ctg ggc ttc cga tec ctg aag gaa 
1344 

Met Lys Asn Leu Asn Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu 

435 440 445 

att agt get ggg cgt ate tat ata agt gec aat agg cag etc tgc tac 
1392 

He Ser Ala Gly Arg He Tyr He Ser Ala Asn Arg Gin Leu Cys Tyr 
450 455 460 

cac cac tct ttg aac tgg acc aag gtg ctt egg ggg cct acg gaa gag 
1440 

His His Ser Leu Asn Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu 
465 470 475 480 
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cga eta gac ate 
1488 

Arg Leu Asp lie 



ggc aaa gtg tgt 
1536 

Gly Lys Val Cys 
500 

ggc cct ggt cag 
1584 

Gly Pro Gly Gin 
515 

tgt gtg acc cac 
1632 

Cys Val Thr His 
530 

cat gag gec gaa 
1680 

His Glu Ala Glu 
545 

ggc act gec aca 
1728 

Gly Thr Ala Thr 



gec cat ttt cga 
1776 

Ala His Phe Arg 
580 

gtc eta ggt gec 
1824 

Val Leu Gly Ala 
595 

gaa tgt egg ccc 
1872 

Glu Cys Arg Pro 
610 

gag ctt caa gac 
1920 

Glu Leu Gin Asp 
625 

cat ctg aca atg 
1968 

His Leu Thr Met 



aag cat aat egg 

Lys His Asn Arg 
485 

gac cca ctg tgc 
Asp Pro Leu Cys 

tgc ttg tec tgt 

Cys Leu Ser Cys 
520 

tgc aac ttt ctg 

Cys Asn Phe Leu 
535 

tgc ttc tec tgc 

Cys Phe Ser Cys 
550 

tgc aat ggc teg 

Cye Asn Gly Ser 
565 

gat ggg ccc cac 

Asp Gly Pro His 

aag ggc cca ate 

Lys Gly Pro lie 
600 

tgc cat gag aac 

Cys His Glu Asn 
615 

tgt tta gga caa 

Cys Leu Gly Gin 
630 

get ttg aca gtg 
Ala Leu Thr Val 



ccg cgc aga gac 

Pro Arg Arg Asp 
490 

tec tct ggg gga 

Ser Ser Gly Gly 
505 

cga aat tat age 
Arg Asn Tyr Ser 

aat ggg gag cct 

Asn Gly Glu Pro 
540 

cac ccg gaa tgc 

His Pro Glu Cys 
555 

ggc tct gat act 

Gly Ser Asp Thr 
570 

tgt gtg age age 

Cys Val Ser Ser 
585 

tac aag tac cca 
Tyr Lys Tyr Pro 

tgc acc cag ggg 

Cys Thr Gin Gly 
620 

aca ctg gtg ctg 

Thr Leu Val Leu 
635 

ata gca gga ttg 
lie Ala Gly Leu 



tgc gtg gca gag 

Cys Val Ala Glu 
495 

tgc tgg ggc cca 

Cys Trp Gly Pro 
510 

cga gga ggt gtc 

Arg Gly Gly Val 
525 

cga gaa ttt gee 
Arg Glu Phe Ala 

caa ccc atg ggg 

Gin Pro Met Gly 
560 

tgt get caa tgt 

Cys Ala Gin Cys 

575 . 

tgc ccc cat gga 

Cys Pro His Gly 
590 

gat gtt cag aat 

Asp Val Gin Asn 
605 

tgt aaa gga cca 
Cys Lys Gly Pro 

ate ggc aaa acc 

lie Gly Lys Thr 
640 

gta gtg att ttc 
Val Val lie Phe 
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atg atg ctg ggc 
2016 

Met Met Leu Gly 
660 

aat aaa agg get 
2064 

Asn Lys Arg Ala 
675 

cct ctg gac ccc 
2112 

Pro Leu Asp Pro 
690 

aaa gag aca gag 
2160 

Lys Glu Thr Glu 
705 

gga act gtg cac 
2208 

Gly Thr Val His 



att cca gtc tgc 
2256 

lie Pro Val Cys 
740 

ttt caa get gtg 
2304 

Phe Gin Ala Val 
755 

gec cac att gta 
2352 

Ala His He Val 
770 

ctt gtc act caa 
2400 . 

Leu Val Thr Gin 
785 

caa cac egg ggg 
2448 

Gin His Arg Gly 



645 

ggc act ttt etc 
Gly Thr Phe Leu 

atg agg cga tac 

Met Arg Arg Tyr 
680 

agt gag aag get 

Ser Glu Lys Ala 
695 

eta agg aag ctt 

Leu Arg Lys Leu 
710 

aaa gga gtg tgg 

Lys Gly Val Trp 
725 

att aaa gtc att 
He Lys Val He 

aca gat cat atg 

Thr Asp His Met 
760 

agg ctg ctg gga 

Arg Leu Leu Gly 
775 

tat ttg cct ctg 

Tyr Leu Pro Leu 
790 

gca ctg ggg cca 

Ala Leu Gly Pro 
805 



650 

tac tgg cgt ggg 

Tyr Trp Arg Gly 
665 

ttg gaa egg ggt 
Leu Glu Arg Gly 

aac aaa gtc ttg 

Asn Lys Val Leu 
700 

aaa gtg ctt ggc 

Lys Val Leu Gly 
715 

ate cct gag ggt 

He Pro Glu Gly 
730 

gag gac aag agt 

Glu Asp Lys Ser 
745 

ctg gee att ggc 
Leu Ala He Gly 

eta tgc cca ggg 

Leu Cys Pro Gly 
780 

ggt tct ctg ctg 

Gly Ser Leu Leu 
795 

cag ctg ctg etc 

Gin Leu Leu Leu 
810 



655 

cgc egg att cag 

Arg Arg He Gin 
670 

gag age ata gag 

Glu Ser He Glu 
685 

gec aga ate ttc 
Ala Arg He Phe 

teg. ggt gtc ttt 

Ser Gly Val Phe 
720 

gaa tea ate aag 

Glu Ser He Lys 
735 

99a egg cag agt 

Gly Arg Gin Ser 
750 

age ctg gac cat 

Ser Leu Asp His 
765 

tea tct ctg cag 
Ser Ser Leu Gin 

gat cat gtg aga 

Asp His Val Arg 
800 

aac tgg gga gta 

Asn Trp Gly Val 
815 



5 



WO 03/011897 



PCT/US02/23963 



caa att gcc aag gga atg tac tac ctt gag gaa cat ggt atg gtg cat 
2496 

Gin lie Ala Lys Gly Met Tyr Tyr Leu Glu Glu His Gly Met Val His 
820 825 830 

aga aac ctg get gcc cga aac gtg eta etc aag tea ccc agt cag gtt 
2544 

Arg Asn Leu Ala Ala Arg Asn Val Leu Leu Lys Ser Pro Ser Gin Val 
835 840 845 

cag gtg gca gat ttt ggt gtg get gac ctg ctg cct cct gat gat aag 
2592 

Gin Val Ala Asp Phe Gly Val Ala Asp Leu Leu Pro Pro Asp Asp Lys 

850 855 860 

cag ctg eta tac agt gag gcc aag act cca att aag tgg atg gcc ctt 
2640 

Gin Leu Leu Tyr Ser Glu Ala Lys Thr Pro lie Lys Trp Met Ala Leu 
865 870 875 880 

gag agt ate cac ttt ggg aaa tac aca cac cag agt gat gtc tgg age 
2688 

Glu Ser lie His Phe Gly Lys Tyr Thr His Gin Ser Asp Val Trp Ser 
885 890 895 

tat ggt gtg aca gtt tgg gag ttg atg ace ttc ggg gca gag ccc tat 
2736 

Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ala Glu Pro Tyr 
900 905 910 

gca ggg eta cga ttg get gaa gta cca gac ctg eta gag aag ggg gag 
2784 

Ala Gly Leu Arg Leu Ala Glu Val Pro Asp Leu Leu Glu Lys Gly Glu 
915 920 925 

egg ttg gca cag ccc cag ate tgc aca att gat gtc tac atg gtg atg 
2832 

Arg Leu Ala Gin Pro Gin lie Cys Thr He Asp Val Tyr Met Val Met 
930 935 940 

gtc aag tgt tgg atg att gat gag aac att cgc cca acc ttt aaa gaa 
2880 

Val Lys Cys Trp Met He Asp Glu Asn He Arg Pro Thr Phe Lys Glu 
945 950 955 960 

eta gcc aat gag ttc acc agg atg gcc cga gac cca cca egg tat ctg 
2928 

Leu Ala Asn Glu Phe Thr Arg Met Ala Arg Asp Pro Pro Arg Tyr Leu 
965 970 975 

gtc ata aag aga gag agt ggg cct gga ata gcc cct ggg cca gag ccc 
2976 

Val He Lys Arg Glu Ser Gly Pro Gly He Ala Pro Gly Pro Glu Pro 
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980 985 990 

cat ggt ctg aca aac aag aag eta gag gaa gta gag ctg gag cca gaa 
3024 

His Gly Leu Thr Asn Lys Lys Leu Glu Glu Val Glu Leu Glu Pro Glu 
995 1000 1005 

eta gac eta gac eta gac ttg gaa gca gag gag gac aac ctg gca ace 
3072 

Leu Asp Leu Asp Leu Asp Leu Glu Ala Glu Glu Asp Asn Leu Ala Thr 
1010 1015 1020 

acc aca ctg ggc tec gec etc age eta cca gtt gga aca ctt aat egg 
3120 

Thr Thr Leu Gly Ser Ala Leu Ser Leu Pro Val Gly Thr Leu Asn Arg 
1025 1030 1035 1040 

cca cgt ggg age cag age ctt tta agt cca tea tct gga tac atg ccc 
3168 

Pro Arg Gly Ser Gin Ser Leu Leu Ser Pro Ser Ser Gly Tyr Met Pro 
1045 1050 1055 

atg aac cag ggt aat ctt ggg ggg tct tgc cag gag tct gca gtt tct 
3216 

Met Asn Gin Gly Asn Leu Gly Gly Ser Cys Gin Glu Ser Ala Val Ser 
1060 1065 1070 

ggg age agt gaa egg tgc ccc cgt cca gtc tct eta cac cca atg cca 
3264 

Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser Leu His Pro Met Pro 
1075 1080 1085 

egg gga tgc ctg gca tea gag tea tea gag ggg cat gta aca ggc tct 
3312 

Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser 
1090 1095 1100 

gag get gag etc cag gag aaa gtg tea atg tgt aga age egg age agg 
3360 

Glu Ala Glu Leu Gin Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg 

1105 1110 1115 1120 

age egg age cca egg cca cgc gga gat age gee tac cat tec cag cgc 
3408 

Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg 
1125 1130 1135 

cac agt ctg ctg act cct gtt acc cca etc tec cca ccc ggg tta gag 
3456 

His Ser Leu Leu Thr Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu 
1140 1145 1150 
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gaa gag gat gtc aac ggt tat gtc atg cca gat aca cac etc aaa ggt 
3504 

Glu Glu Asp Val Asn Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly 
1155 1160 1165 

act ccc tec tec egg gaa ggc acc ctt tct tea gtg ggt etc agt tct 
3552 

Thr Pro Ser Ser Arg Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser 

1170 1175 1180 

gtc ctg ggt act gaa gaa gaa gat gaa gat gag gag tat gaa tac atg 
3600 

Val Leu Gly Thr Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met 
1185 1190 1195 1200 

aac egg agg aga agg cac agt cca cct cat ccc cct agg cca agt tec 
3648 

Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser 
1205 1210 1215 

ctt gag gag ctg ggt tat gag tac atg gat gtg ggg tea gac etc agt 
3696 

Leu Glu Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser 
1220 1225 1230 

gee tct ctg ggc age aca cag agt tgc cca etc cac cct gta ccc ate 
3744 

Ala Ser Leu Gly Ser Thr Gin Ser Cys Pro Leu His Pro Val Pro He 

1235 1240 1245 

atg ccc act gca ggc aca act cca gat gaa gac tat gaa tat atg aat 
3792 

Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn 
1250 1255 1260 

egg caa cga gat gga ggt ggt cct ggg ggt gat tat gca gee atg ggg 
3840 

Arg Gin Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly 
1265 1270 1275 1280 

gec tgc cca gca tct gag caa ggg tat gaa gag atg aga get ttt cag 
3888 

Ala Cys Pro Ala Ser Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin 
1285 1290 1295 

ggg cct gga cat cag gec ccc cat gtc cat tat gee cgc eta aaa act 
3936 

Gly Pro Gly His Gin Ala Pro His Val His Tyr Ala Arg Leu Lys Thr 
1300 1305 1310 

eta cgt age tta gag get aca gac tct gee ttt gat aac cct gat tac 
3984 

Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe Asp A.sn Pro Asp Tyr 
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1315 1320 1325 

tgg cat age agg ctt ttc ccc aag get aat gec cag aga acg 
4026 

Trp His Ser Arg Leu Phe Pro Lys Ala Asn Ala Gin Arg Thr 
1330 1335 1340 



<210> 2 

<211> 1342 

<212> PRT 

<213> Homo sapiens 



<400> 2 



Met Arg 


Ala 


Asn 


Asp Ala Leu Gin 


1 






5 


Ala Arg 


Gly 


Ser 


Glu Val Gly Asn 






20 




Leu Asn 


Gly 


Leu 


Ser Val Thr Gly 




35 




40 


Leu Tyr 


Lvs 


Leu 


Tyr Glu Arg Cys 


50 






55 


lie Val 


Leu 


Thr 


Gly His Asn Ala 


65 






70 


Arg Glu 


Val 


Thr 


Gly Tyr Val Leu 








85 


Leu Pro 


Leu 


Pro 


Asn Leu Arg Val 






100 




Gly Lys 


Phe 


Ala 


He Phe Val Met 




115 




120 


His Ala 


Leu 


Arg 


Gin Leu Arg Leu 


130 






135 


Gly Gly 


val 


Tyr 


He Glu Lys Asn 


145 






150 


lie Asp 


Trp 


Arg 


Asp lie Val Arg 








165 


Lys Asp 


Asn 


Gly 


Arg Ser Cys Pro 






180 




Arg Cys 


Trp 


Gly 


Pro Gly Ser Glu 




195 




200 


He Cys 


Ala 


Pro 


Gin Cys Asn Gly 


210 






215 


Gin Cys 


Cys 


His 


Asp Glu Cys Ala 


225 






230 


Thr Asp 


Cys 


Phe 


Ala Cys Arg His 








245 


Pro Arg 


Cys 


Pro 


Gin Pro Leu Val 






260 




Glu Pro 


Asn 


Pro 


His Thr Lys Tyr 




275 




280 


Ser Cys 


Pro 


His 


Asn Phe Val Val 


290 






295 



Val Leu Gly Leu Leu 


Phe 


p 

Ser 


Leu 


10 




1 c; 
X 3 




Ser Gin Ala Val Cys 


Pro 


Gly 


Thr 


25 


30 






Asp Ala Glu Asn Gin 


Tyr 


Gin 


inr 


45 








Glu Val Val Met Gly Asn 


Leu 


Glu 


60 








Asp Leu Ser Phe Leu 


Gin 


Trp 


lie 


75 








Val Ala Met Asn Glu 


Phe 


Ser 


inr 


90 . 




QC 




Val Arg Gly Thr Gin 


Val 


Tyr 


Asp 


105 


110 






Leu Asn Tyr Asn Thr Asn 


Ser 


Ser 


125 








Thr Gin Leu Thr Glu 


He 


Leu 


Ser 


140 








Asp Lys Leu Cys His 


Met 


Asp 


Thr 


155 






160 


Asp Arg Asp Ala Glu 


He 


Val 


Val 


170 




175 




Pro Cys His Glu Val 


Cys 


Lys 


Gly 


185 


190 






Asp Cys Gin Thr Leu 


Thr 


Lys 


Thr 


205 








His Cys Phe Gly Pro Asn 


Pro 


Asn 


220 








Gly Gly Cys Ser Gly 


Pro 


Gin 


Asp 


235 






240 


Phe Asn Asp Ser Gly Ala 


Cys 


Val 


250 




255 




Tyr Asn Lys Leu Thr 


Phe 


Gin 


Leu 


265 


270 






Gin Tyr Gly Gly Val 


Cys 


Val 


Ala 


285 








Asp Gin Thr Ser Cys 


Val 


Arg 


Ala 



300 



9 



WO 03/011897 



Cys 


Pro 


Pro 


Asp 


Lys 


Met 


Glu 


Val 


305 










310 






Glu 


Pro 


Cys 


Gly 


Gly 


Leu 


Cys 


Pro 










325 








Gly 


Ser 


Arg 


Phe 


Gin 


Thr 


Val 


Asp 








340 










Asn 


Cys 


Thr 


Lys 


He 


Leu 


Gly 


Asn 






355 










360 


Asn 


Gly 


Asp 


Pro 


Trp 


His 


Lys 


He 




370 










375 




Asn 


Val 


Phe 


Arg 


Thr 


Val 


Arg 


Glu 


385 










390 






Ser 


Trp 


Pro 


Pro 


His 


Met 


His 


Asn 










405 








Thr 


He 


Gly 


Gly 


Arg 


Ser 


Leu 


Tyr 








,420 










Met 


Lys 


Asn 


Leu 


Asn 


Val 


Thr 


Ser 






435 










440 


lie 


Ser 


Ala 


Gly 


Arg 


He 


Tyr 


He 




450 










455 




His 


His 


Ser 


Leu 


Asn 


Trp 


Thr 


Lys 


465 










470 






Arg 


Leu 


Asp 


He 


Lys 


His 


Asn 


Arg 










485 








Gly 


Lys 


Val 


Cys 


Asp 


Pro 


Leu 


Cys 








500 










Gly 


Pro 


Gly 


Gin 


Cys 


Leu 


Ser 


Cys 






515 










520 


Cys 


Val 


Thr 


His 


Cys 


Asn 


Phe 


Leu 




530 










535 




His 


Glu 


Ala 


Glu 


Cys 


Phe 


Ser 


Cys 


S45 










550 






Gly 


Thr 


Ala 


Thr 


Cys 


Asn 


Gly 


Ser 










565 








Ala 


His 


Phe 


Arg 


Asp 


Gly 


Pro 


His 








580 










Val 


Leu 


Gly 


Ala 


Lys 


Gly 


Pro 


He 






595 










600 


Glu 


Cys 


Arg 


Pro 


Cys 


His 


Glu 


Asn 




610 










615 




Glu 


Leu 


Gin 


Asp 


Cys 


Leu 


Gly 


Gin 


625 










630 






His 


Leu 


Thr 


Met 


Ala 


Leu 


Thr 


Val 










645 








Met 


Met 


Leu 


Gly 


Gly 


Thr 


Phe 


Leu 








660 










,Asn 


Lys 


Arg 


Ala 


Met 


Arg 


Arg 


Tyr 






675 










680 


Pro 


Leu 


Asp 


Pro 


Ser 


Glu 


Lys 


Ala 




690 










695 




Lys 


Glu 


Thr 


Glu 


Leu 


Arg 


Lys 


Leu 


705 










710 






Gly 


Thr 


Val 


His 


Lys 


Gly 


Val 


Trp 
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Asp 


Lys 


Asn 


Gly 


Leu 


Lys 


Met 


Cys 






315 










320 


Lys 


Ala 


Cys 


Glu 


Gly 


Thr 


Gly 


Ser 




330 










335 




Ser 


Ser 


Asn 


He 


Asp 


Gly 


Phe 


Val 


345 










350 






Leu 


Asp 


Phe 


Leu 


He 


Thr 


Gly 


Leu 










365 








Pro 


Ala 


Leu 


Asp 


Pro 


Glu 


Lys 


Leu 








380 










He 


Thr 


Gly 


Tyr 


Leu 


Asn 


He 


Gin 






395 










400 


Phe 


Ser 


Val 


Phe 


Ser 


Asn 


Leu 


Thr 




410 










415 




Asn 


Arg 


Gly 


Phe 


Ser 


Leu 


Leu 


He 


425 










430 






Leu 


Gly 


Phe 


Arg 


Ser 


Leu 


Lys 


Glu 










445 








Ser 


Ala 


Asn 


Arg 


Gin 


Leu 


Cys 


Tyr 








460 










Val 


Leu 


Arg 


Gly 


Pro 


Thr 


Glu 


Glu 






475 










480 


Pro 


Arg 


Arg 


Asp 


Cys 


Val 


Ala 


Glu 




490 










495 




Ser 


Ser 


Gly 


Gly 


Cys 


Trp 


Gly 


Pro 


505 










510 






Arg 


Asn 


Tyr 


Ser 


Arg 


Gly 


Gly 


Val 










525 








Asn 


Gly 


Glu 


Pro 


Arg 


Glu 


Phe 


Ala 








540 










His 


Pro 


Glu 


Cys 


Gin 


Pro 


Met 


Gly 






555 










560 


Gly 


Ser 


Asp 


Thr 


Cys 


Ala 


Gin 


Cys 




570 










575 




Cys 


Val 


Ser 


Ser 


Cys 


Pro 


His 


Gly 


585 










590 






Tyr 


Lys 


Tyr 


Pro 


Asp 


Val 


Gin 


Asn 










605 








Cys 


Thr 


Gin 


Gly 


Cys 


Lys 


Gly 


Pro 








620 










Thr 


Leu 


Val 


Leu 


He 


Gly 


Lys 


Thr 






635 










640 


He 


Ala 


Gly 


Leu 


Val 


Val 


He 


Phe 




650 










655 




Tyr 


Trp 


Arg 


Gly 


Arg 


Arg 


He 


Gin 


665 










670 






Leu 


Glu 


Arg 


Gly 


Glu 


Ser 


He 


Glu 










685 








Asn 


Lys 


Val 


Leu 


Ala 


Arg 


He 


Phe 








700 










Lys 


Val 


Leu 


Gly 


Ser 


Gly 


Val 


Phe 






715 










720 


He 


Pro 


Glu 


Gly 


Glu 


Ser 


He 


Lys 
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725 










730 










735 




lie 


Pro 


Val Cys He 


Lys 


Val 


He 


Glu 


Asp 


Lys 


Ser 


Gly 


Arg 


Gin 


Ser 






740 








745 










750 






Phe 


Gin 


Ala Val Thr 


Asp 


His 


Met 


Leu 


Ala 


He 


Gly 


Ser 


Leu 


Asp 


His 






755 






760 










765 








Ala 


His 


He Val Arg 


Leu 


Leu 


Gly 


Leu 


Cys 


Pro 


Gly 


Ser 


Ser 


Leu 


Gin 




770 






775 










780 










Leu 


Val 


Thr Gin Tyr 


Leu 


Pro 


Leu 


Gly 


Ser 


Leu 


Leu 


Asp 


His 


Val 


Arg 


785 






790 










795 










800 


Gin 


His 


Arg Gly Ala 


Leu 


Gly 


Pro 


Gin 


Leu 


Leu 


Leu 


Asn 


Trp 


Gly 


Val 






805 










810 










815 




Gin 


He 


Ala Lys Gly 


Met 


Tyr 


Tyr 


Leu 


Glu 


Glu 


His 


Gly 


Met 


Val 


His 






820 








825 










830 






Arg 


Asn 


Leu Ala Ala 


Arg 


Asn 


Val 


Leu 


Leu 


Lys 


Ser 


Pro 


Ser 


Gin 


Val 






835 






840 










845 








Gin 


Val 


Ala Asp Phe 


Gly 


Val 


Ala 


Asp 


Leu 


Leu 


Pro 


Pro 


Asp 


Asp 


Lys 




850 






855 










860 










Gin 


Leu 


Leu Tyr Ser 


Glu 


Ala 


Lys 


Thr 


Pro 


He 


Lys 


Trp 


Met 


Ala 


Leu 


865 






870 










875 










880 


Glu 


Ser 


He His Phe 


Gly 


Lys 


Tyr 


Thr 


His 


Gin 


Ser 


Asp 


Val 


Trp 


Ser 






885 










890 










895 




Tyr 


Gly 


Val Thr Val 


Trp 


Glu 


Leu 


Met 


Thr 


Phe 


Gly 


Ala 


Glu 


Pro 


Tyr 






900 








905 










910 






Ala 


Gly 


Leu Arg Leu 


Ala 


Glu 


Val 


Pro 


Asp 


Leu 


Leu 


Glu 


Lys 


Gly 


Glu 






915 






920 










925 








Arg 


Leu 


Ala Gin Pro 


Gin 


He 


Cys 


Thr 


He 


Asp 


Val 


Tyr 


Met 


Val 


Met 


930 






935 










940 










Val 


Lys 


Cys Trp Met 


He 


Asp 


Glu 


Asn 


He 


Arg 


Pro 


Thr 


Phe 


Lys 


Glu 


945 






950 










955 










960 


Leu 


Ala 


Asn Glu Phe 


Thr 


Arg 


Met 


Ala 


Arg 


Asp 


Pro 


Pro 


Arg 


Tyr 


Leu 






965 










970 










975 




Val 


He 


Lys Arg Glu 


Ser 


Gly 


Pro 


Gly 


He 


Ala 


Pro 


Gly 


Pro 


Glu 


Pro 






980 








985 










.990 






His 


Gly 


Leu Thr Asn 


Lys 


Lys 


Leu 


Glu 


Glu 


Val 


Glu 


Leu 


Glu 


Pro 


Glu 






995 






1000 








1005 






Leu 


Asp 


Leu Asp Leu 


Asp 


Leu 


Glu Ala 


Glu 


Glu 


Asp 


Asn 


Leu 


Ala 


Thr 




1010 




1015 








1020 








Thr 


Thr Leu Gly Ser 


Ala 


Leu 


Ser 


Leu 


Pro 


Val 


Gly Thr Leu 


Asn 


Arg 


1025 




1030 








1035 








1040 


Pro Arg Gly Ser Gin 


Ser 


Leu 


Leu 


Ser 


Pro 


Ser 


Ser Gly Tyr 


Met 


Pro 



1045 1050 1055 



Met Asn Gin Gly Asn Leu Gly Gly Ser Cys Gin Glu Ser Ala Val Ser 

1060 1065 . 1070 

Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser Leu His Pro Met Pro 

1075 1080 1085 

Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser 

1090 1095 1100 

Glu Ala Glu Leu Gin Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg 
1105 1110 1115 1120 

Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg 

1125 1130 1135 

His Ser Leu Leu Thr Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu 
1140 1145 1150 
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Glu Glu Asp Val Asn Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly 

1155 1160 1165 

Thr Pro Ser Ser Arg Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser 

1170 1175 1180 

Val Leu Gly Thr Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met 
1185 1190 1195 1200 

Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser 

1205 1210 1215 

Leu Glu Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser 

1220 1225 1230 

Ala Ser Leu Gly Ser Thr Gin Ser Cys Pro Leu His Pro Val Pro He 

1235 1240 1245 

Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn 

1250 1255 1260 

Arg Gin Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly 
1265 1270 1275 1280 

Ala Cys Pro Ala Ser Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin 

1285 1290 1295 

Gly Pro Gly His Gin Ala Pro His Val His Tyr Ala Arg Leu Lys Thr 

1300 1305 1310 

Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe Asp Asn Pro Asp Tyr 

1315 1320 1325 

Trp His Ser Arg Leu Phe Pro Lys Ala Asn Ala Gin Arg Thr 
1330 1335 1340 



<210> 3 
<211> 1935 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (1935) 

<400> 3 

atg tec gag cgc aaa gaa ggc aga 
Met Ser Glu Arg Lys Glu Gly Arg 
1 5 

gag cga ggc tec ggc aag aag ccg 
Glu Arg Gly Ser Gly Lys Lys Pro 
20 



ggc aaa ggg aag ggc aag aag aag 48 
Gly Lys Gly Lys Gly Lys Lys Lys 
10 ' 15 

gag tec gcg gcg ggc age cag age 96 
Glu Ser Ala Ala Gly Ser Gin Ser 

25 30 . 



cca gec ttg cct ccc caa ttg aaa gag atg aaa age cag gaa teg get 144 

Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala 

35 40 45 

gca ggt tec aaa eta gtc ctt egg tgt gaa acc agt tct gaa tac tec 192 

Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser 

50 55 60 

tct etc aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga aaa 240 
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Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys 
65 70 75 80 

aac aaa cca caa aat ate. aag ata caa aaa aag cca ggg aag tea gaa 288 
Asn Lys Pro Gin Asn lie Lys He Gin Lys Lys Pro Gly Lys Ser Glu 
85 ' 90 95 

ctt cgc att aac aaa gca tea ctg get gat tct gga gag tat atg tgc 336 
Leu Arg He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
100 105 110 

aaa gtg ate age aaa tta gga aat gac agt gee tct gee aat ate ace 3 84 
Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn He Thr 
115 120 125 

ate gtg gaa tea aac gag ate ate act ggt atg cca gec tea act gaa 432 
He Val Glu Ser Asn Glu He He Thr Gly Met Pro Ala Ser Thr Glu 
130 135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea gta tec aca 480 
Gly Ala Tyr Val Ser Ser Glu Ser Pro He Arg He Ser Val Ser Thr 
145 150 155 160 

gaa gga gca aat act tct tea tct aca tct aca tec acc act ggg aca 528 
Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
165 170 175 

age cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt gtg aat 576 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
180 185 190 

99 a 999 9 a 9 t9 c ttc at 9 9*^9 aaa 9 ac ctt tca aac ccc tc 9 a 9 a tac 624 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
195 200 205 

ttg tgc aag tgc cca aat gag ttt act ggt gat cgc tgc caa aac tac 672 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
210 215 220 

gta atg gee age ttc tac aag cat ctt. ggg att gaa ttt atg gag gcg 720 
Val Met Ala Ser Phe Tyr Lys His Leu Gly He Glu Phe Met Glu Ala 
225 230 235 240 

gag gag ctg tac cag aag aga gtg ctg acc ata acc ggc ate tgc ate 768 
Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys He 
245 250 255 

gec etc ctt gtg gtc ggc ate atg tgt gtg gtg gee tac tgc aaa acc 816 
Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr Cys Lys Thr 
260 265 270 

aag aaa cag egg aaa aag ctg cat gac cgt ctt egg cag age ctt egg 864 
Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg 
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275 280 285 

tct gaa cga aac aat atg atg aac att gcc aat ggg cct cac cat cct 

Ser Glu Arg Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 

290 295 300 



912 



aac cca ccc ccc gag aat gtc cag ctg gtg aat caa tac gta tct aaa 960 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
305 310 315 320 

aac gtc ate tec agt gag cat att gtt gag aga gaa gca gag aca tec 
1008 • 

Asn Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser 
325 330 335 

ttt tec acc agt cac tat act tec aca gcc cat cac tec act act gtc 
1056 

Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val 
340 345 350 

acc cag act cct age cac age tgg age aac gga cac act gaa age ate 
1104 

Thr Gin Thr. Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 

355 360 365 

ctt tec gaa age cac tct gta ate gtg atg tea tec gta gaa aac agt 
1152 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn Ser 
370 375- 380 

agg cac age age cca act ggg ggc cca aga gga cgt ctt aat ggc aca 
1200 

Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr 
385 390 395 400 

gga ggc cct cgt gaa tgt aac age ttc etc agg cat gcc aga gaa acc 
1248 

Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr 

405 410 415 

cct gat tec tac cga gac tct cct cat agt gaa agg tat gtg tea gcc 
1296 

Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala 
420 425 430 

atg acc acc ccg get cgt atg tea cct gta gat ttc cac acg cca age 
1344 

Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser 
435 440 445 

tec ccc aaa teg ccc cct teg gaa atg tct cca ccc gtg tec age atg 
1392 

Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met 
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450 

acg gtg tec atg 
1440 

Thr Val Ser Met 
465 

aga cct eta ctt 
1488 

Arg Pro Leu Leu 



gac cat cac cct 
1536 

Asp His His Pro 
500 

gac agt aac age 
1584 

Asp Ser Asn Ser 
515 

gag tat gaa acg 
1632 

Glu Tyr Glu Thr 
530 

aaa etc gee aat 
1680 

Lys Leu Ala Asn 
545 

att get aac aga 
1728 

lie Ala Asn Arg 



aac tea gag agt 
1776 

Asn Ser Glu Ser 
580 

ttc ctg ggc ata 
1824 

Phe Leu Gly lie 
595 

gee ttc cgc ctg 
1872 

Ala Phe Arg Leu 
610 



455 

cct tec atg gcg 

Pro Ser Met Ala 
470 

etc gtg aca cca 

Leu Val Thr Pro 
485 

cag cag ttc age 
Gin Gin Phe Ser 

etc cct get age 

Leu Pro Ala Ser 
520 

ace caa gag tac 

Thr Gin Glu Tyr 
535 

age egg egg gee 

Ser Arg Arg Ala 
550 

ttg gaa gtg gac 

Leu Glu Val Asp 
565 

gaa aca gaa gat 

Glu Thr Glu Asp 

cag aac ccc ctg 

Gin Asn Pro Leu 
600 

get gac age agg 

Ala Asp Ser Arg 
615 



460 

gtc age ccc ttc 

Val Ser Pro Phe 
475 

cca agg ctg egg 

Pro Arg Leu Arg 
490 

tec ttc cac cac 

Ser Phe His His 
505 

ccc ttg agg ata 
Pro Leu Arg lie 

gag cca gee caa 

Glu Pro Ala Gin 
540 

aaa aga ace aag 

Lys Arg Thr Lys 
555 

age aac aca age 

Ser Asn Thr Ser 
570 

gaa aga gta ggt 

Glu Arg Val Gly 
585 

gca gec agt ctt 

Ala Ala Ser Leu 

act aac cca gca 

Thr Asn Pro Ala 
620 



atg gaa gaa gag 

Met Glu Glu Glu 
480 

gag aag aag ttt 

Glu Lys Lys Phe 
495 

aac ccc gcg cat 

Asn Pro Ala His 
510 

gtg gag gat gag 

Val Glu Asp Glu 
525 

gag cct gtt aag 
Glu Pro Val Lys 

ccc aat ggc cac 

Pro Asn Gly His 
560 

tec cag age agt 

Ser Gin Ser Ser 
575 

gaa gat acg cct 

Glu Asp Thr Pro 
590 

gag gca aca cct 

Glu Ala Thr Pro 
605 

ggc cgc ttc teg 
Gly Arg Phe Ser 
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aca cag 


gaa 


gaa 


ate 


cag 


gec 


agg 


ctg 


tct agt gta 


att get aac caa 


1920 




















Thr Gin 


Glu 


Glu 


He 


Gin 


Ala 


Arg 


Leu 


Ser Ser Val 


He Ala Asn Gin 


625 








63 0 








635 


640 


gac cct 


att 


get 


gta 














1935 




















Asp Pro 


lie 


Ala 


Val 




















645 














<210> 4 




















<211> 645 


















<212> PRT 


















<213> Homo sapiens 














<400> 4 




















Met Ser 


Glu 


Arg 


Lys 


Glu 


Gly 


Arg 


Gly 


Lys Gly Lys 


Gly Lys Lys Lys 


1 






5 










10 


15 


Glu Arg 


Gly 


Ser 


Gly 


Lys 


Lys 


Pro 


Glu 


Ser Ala Ala 


Gly Ser Gin Ser 






20 










25 




30 


Pro Ala 


Leu 


Pro 


Pro 


Gin 


Leu 


Lys 


Glu 


Met Lys Ser 


Gin Glu Ser Ala 




35 










40 






45 


Ala Gly 


Ser 


Lys 


Leu 


Val 


Leu 


Arg 


Cys 


Glu Thr Ser 


Ser Glu Tyr Ser 


50 










55 






60 




Ser Leu 


Arg 


Phe 


Lys 


Trp 


Phe 


Lys 


Asn 


Gly Asn Glu 


Leu Asn Arg Lys 


65 








70 








75 


80 


Asn Lys 


Pro 


Gin 


Asn 


He 


Lys 


He 


Gin 


Lys Lys Pro 


Gly Lys Ser Glu 








85 










90 


95 


Leu Arg 


lie 


Asn 


Lys 


Ala 


Ser 


Leu 


Ala 


Asp Ser Gly 


Glu Tyr Met Cys 






100 










105 




110 


Lys Val 


He 


Ser 


Lys 


Leu 


Gly 


Asn 


Asp 


Ser Ala Ser 


Ala Asn He Thr 


115 










120 






125 


lie Val 


Glu 


Ser 


Asn 


Glu 


He 


He 


Thr 


Gly Met Pro 


Ala Ser Thr Glu 


130 










135 






140 




Gly Ala 


Tyr 


Val 


Ser 


Ser 


Glu 


Ser 


Pro 


He Arg He 


Ser Val Ser Thr 


145 








150 








155 


160 


Glu Gly 


Ala 


Asn 


Thr 


Ser 


Ser 


Ser 


Thr 


Ser Thr Ser 


Thr Thr Gly Thr 








165 










170 


175 


Ser His 


Leu 


Val 


Lys 


Cys 


Ala 


Glu 


Lys 


Glu Lys Thr 


Phe Cys Val Asn 






180 










185 




190 


Gly Gly 


Glu 


Cys 


Phe 


Met 


Val 


Lys 


Asp 


Leu Ser Asn 


Pro Ser Arg Tyr 




195 










200 






205 


Leu Cys 


Lys 


Cys 


Pro 


Asn 


Glu 


Phe 


Thr 


Gly Asp Arg 


Cys Gin Asn Tyr 


210 










215 






220 




Val Met 


Ala 


Ser 


Phe 


Tyr 


Lys 


His 


Leu 


Gly He Glu 


Phe Met Glu Ala 


225 








230 








235 


240 


Glu Glu 


Leu 


Tyr 


Gin 


Lys 


Arg 


Val 


Leu 


Thr He Thr 


Gly He Cys lie 








245 










250 


255 


Ala Leu 


Leu 


Val 


Val 


Gly 


He 


Met 


Cys 


Val Va^ Ala 


Tyr Cys Lys Thr 






260 










265 




270 


Lys Lys 


Gin 


Arg 


Lys 


Lys 


Leu 


His 


Asp 


Arg Leu Arg 


Gin Ser Leu Arg 




275 










280 






285 
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Ser 


Glu 


Arg 


Asn 


Asn 


Met 


Met Asn He Ala 


Asn 


Gly 


Pro 


His 


His 


Pro 




290 










295 




300 










Asn 


Pro 


Pro 


Pro 


Glu 


Asn 


Val Gin Leu Val 


Asn 


Gin 


Tyr 


Val 


Ser 


Lys 


305 










310 




315 










320 


Asn 


Val 


lie 


Ser 


Ser 


Glu 


His He Val Glu 


Arg 


Glu 


Ala 


Glu 


Thr 


Ser 










325 




330 










335 




Phe 


Ser 


Thr 


Ser 


Hia 


Tyr 


Thr Ser Thr Ala 


His 


His 


Ser 


Thr 


Thr 


Val 








340 






345 








3 50 






Thr 


Gin 


Thr 


Pro 


Ser 


His 


Ser- Trp Ser Asn 


Gly 


His 


Thr 


Glu 


Ser 


lie 






355 








360 






365 








Leu 


Ser 


Glu 


Ser 


His 


Ser 


Val He Val Met 


Ser 


Ser 


Val 


Glu 


Asn 


Ser 




370 










375 




380 










Arg 


His 


Ser 


Ser 


Pro 


Thr 


Gly Gly Pro Arg 


Gly 


Arg 


Leu 


Asn 


Gly Thr 


385 










390 




395 










400 


Gly 


Gly 


Pro 


Arg 


Glu Cys 


Asn Ser Phe Leu 


Arg 


His 


Ala 


Arg 


Glu 


Thr 










405 




410 










415 




Pro 


Asp 


Ser 


Tyr 


Arg Asp 


Ser Pro His Ser 


Glu 


Arg 


Tyr 


Val 


Ser 


Ala 








420 






425 








430 






Met 'Thr 


Thr 


Pro 


Ala Arg 


Met Ser Pro Val 


Asp 


Phe 


His 


Thr 


Pro 


Ser 






435 








440 






445 








Ser 


Pro 


Lys 


Ser 


Pro 


Pro 


Ser Glu Met Ser 


Pro 


Pro 


Val 


Ser 


Ser 


Met 




450 










455 




460 










Thr 


Val 


Ser 


Met 


Pro 


Ser 


Met Ala Val Ser 


Pro 


Phe 


Met 


Glu 


Glu 


Glu 


465 










470 




475 










480 


Arg 


Pro 


Leu 


Leu 


Leu 


Val 


Thr Pro Pro Arg 


Leu 


Arg 


Glu 


Lys 


Lys 


Phe 










485 




490 










495 




Asp 


His 


His 


Pro 


Gin 


Gin 


Phe Ser Ser Phe 


His 


His 


Asn 


Pro 


Ala 


His 








500 






505 








510 






Asp 


Ser 


Asn 


Ser 


Leu 


Pro 


Ala Ser Pro Leu 


Arg 


He 


Val 


Glu 


Asp 


Glu 






515 








520 






525 








Glu 


Tyr 


Glu 


Thr 


Thr 


Gin 


Glu Tyr Glu Pro 


Ala 


Gin 


Glu 


Pro 


Val 


Lys 




530 










535 




540 










Lys 


Leu 


Ala 


Asn 


Ser Arg 


Arg Ala Lys Arg 


Thr 


Lys 


Pro 


Asn 


Gly His 


545 










550 




555 










560 


He 


Ala 


Asn 


Arg 


Leu 


Glu 


Val Asp Ser Ash 


Thr 


Ser 


Ser 


Gin 


Ser 


Ser 










565 




570 










575 




Asn 


Ser 


Glu 


Ser 


Glu 


Thr 


Glu Asp Glu Arg 


Val 


Gly 


Glu 


Asp 


Thr 


Pro 








580 






585 








590 






Phe 


Leu 


Gly 


He 


Gin 


Asn 


Pro Leu Ala Ala 


Ser 


Leu 


Glu 


Ala 


Thr 


Pro 






595 








600 






605 








Ala 


Phe 


Arg 


Leu 


Ala Asp 


Ser Arg Thr Asn 


Pro 


Ala 


Gly 


Arg 


Phe Ser 




610 










615 




620 










Thr 


Gin 


Glu 


Glu 


He 


Gin 


Ala Arg Leu Ser 


Ser 


Val 


He 


Ala 


Asn 


Gin 


625 










630 




635 










640 


Asp 


Pro 


He 


Ala 


Val 



















645 



<210> 5 

<211> 3765 

<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 

<222> (1) . . . (3765) 
<400> 5 

atg gag ctg gcg gcc ttg tgc cgc tgg ggg etc etc etc gee etc ttg 48 
Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu 
15 10 15 

ccc ccc gga gcc gcg age acc caa gtg tgc acc ggc aca gac atg aag 96 
Pro Pro Gly Ala Ala Ser Thr Gin Val Cys Thr Gly Thr Asp Met Lys 
20 25 30 

ctg egg etc cct gcc agt ccc gag acc cac ctg gac atg etc cgc cac 144 
Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His 
35 40 45 

etc tac cag ggc tgc cag gtg gtg cag gga aac ctg gaa etc acc tac 192 
Leu Tyr Gin Gly Cys Gin Val Val Gin Gly Asn Leu Glu Leu Thr Tyr 
50 55 60 

ctg ccc acc aat gcc age ctg tec ttc ctg cag gat ate cag gag gtg 240 
Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gin Asp lie Gin Glu Val ' 
65 70 75 80 

ca 9 99C tac gtg etc ate get cac aac caa gtg agg cag gtc cca ctg 288 
Gin Gly Tyr Val Leu lie Ala His Asn Gin Val Arg Gin Val Pro Leu 
85 90 95 

cag agg ctg egg att gtg cga ggc acc cag etc ttt gag gac aac tat 336 
Gin Arg Leu Arg lie Val Arg Gly Thr Gin Leu Phe Glu Asp Asn Tyr 
100 105 110 

gcc ctg gcc gtg eta gac aat gga gac ccg ctg aac aat acc acc cct 384 
Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro 
115 120 125 

gtc aca ggg gcc tec cca gga ggc ctg egg gag ctg cag ctt cga age 432 
Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gin Leu Arg Ser 
130 135 140 

etc aca gag ate ttg aaa gga ggg gtc ttg ate cag egg aac ccc cag 480 
Leu Thr Glu lie Leu Lys Gly Gly Val Leu lie Gin Arg Asn Pro Gin 
145 150 155 160 

etc tgc tac cag gac acg att ttg tgg aag gac ate ttc cac aag aac 528 
Leu Cys Tyr Gin Asp Thr He Leu Trp Lys Asp He Phe His Lys Asn 
165 170 175 

aac cag ctg get etc aca ctg ata gac acc aac cgc tct egg gcc tgc , 576 
Asn Gin Leu Ala Leu Thr Leu He Asp Thr Asn Arg Ser Arg Ala Cys 
180 185 190 
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cac ccc tgt tct ccg atg tgt aag ggc tec cgc tgc tgg gga gag agt 624 
His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser 
195 200 205 

tct gag gat tgt cag age ctg acg cgc act gtc tgt gec ggt ggc tgt 672 
Ser Glu Asp Cys Gin Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys 
210 215 220 

gec cgc tgc aag ggg cca ctg ccc act gac tgc tgc cat gag cag tgt 720 
Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gin Cys 
225 ~ 230 235 240 

get gec ggc tgc acg ggc ccc aag cac tct gac tgc ctg gee tgc etc 768 
Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu 
245 250 255 

cac ttc aac cac agt ggc ate tgt gag ctg cac tgc cca gee ctg gtc 816 
His Phe Asn His Ser Gly He Cys Glu Leu His Cys Pro Ala Leu Val 
260 265 270 

acc tac aac aca gac acg ttt gag tec atg ccc aat ccc gag ggc egg 864 
Thr Tyr Asn Thr Asp Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg 
275 280 285 

tat aca ttc ggc gee age tgt gtg act gee tgt ccc tac aac tac ctt 912 
Tyr Thr Phe Gly Ala Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu 
290 295 300 ; 

tct acg gac gtg gga tec tgc acc etc gtc tgc ccc ctg cac aac caa 960 
Ser Thr Asp Val Gly Ser Cys Thr Leu Val Cys Pro Leu His Asn Gin 
305 * 310 315 320 

gag gtg aca gca gag gat gga aca cag egg tgt gag aag tgc age aag 
1008 

Glu Val Thr Ala Glu Asp Gly Thr Gin Arg Cys Glu Lys Cys Ser Lys 
325 330 335 

ccc tgt gee cga gtg tgc tat ggt ctg ggc atg gag cac ttg cga gag 
1056 

Pro Cye Ala Arg Val Cys Tyr Gly Leu Gly Met Glu His Leu Arg Glu 
340 345 350 

gtg agg gca gtt acc agt gee aat ate cag gag ttt get ggc tgc aag 
1104 

Val Arg Ala Val Thr Ser Ala Asn He Gin Glu Phe Ala Gly Cys Lys 
355 360 365 

aag ate ttt ggg age ctg gca ttt ctg ccg gag age ttt gat ggg gac 
1152 

Lys He Phe Gly Ser Leu Ala Phe Leu Pro Glu Ser Phe Asp Gly Asp 
370 375 380 
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cca gcc tec aac act gec ccg etc cag cca gag cag etc caa gtg ttt 
1200 

Pro Ala Ser Asn Thr Ala Pro Leu Gin Pro Glu Gin Leu Gin Val Phe 

385 390 395 400 

gag act ctg gaa gag ate aca ggt tac eta tac ate tea gca tgg ccg 
1248 

Glu Thr Leu Glu Glu lie Thr Gly Tyr Leu Tyr He Ser Ala Trp Pro 
405 410 415 

gac age ctg cct gac etc age gtc ttc cag aac ctg caa gta ate egg 
1296 

Asp Ser Leu Pro Asp Leu Ser Val Phe Gin Asn Leu Gin Val He Arg 
420 425 430 

gga cga att ctg cac aat ggc gcc tac teg ctg ace ctg caa ggg ctg 
1344 

Gly Arg He Leu His Asn Gly Ala Tyr Ser Leu Thr Leu Gin Gly Leu 
435 440 445 

ggc ate age tgg ctg ggg ctg cgc tea ctg agg gaa ctg ggc agt gga 
1392 

Gly He Ser Trp Leu Gly Leu Arg Ser Leu Arg Glu Leu Gly Ser Gly 

450 455 460 

ctg gcc etc ate cac cat aac acc cac etc tgc ttc gtg cac acg gtg 
1440 

Leu Ala Leu He His His Asn Thr His Leu Cys Phe Val His Thr Val 

465 470 . 475 480 

ccc tgg gac cag etc ttt egg aac ccg cac caa get ctg etc cac act 
1488 

Pro Trp Asp Gin Leu Phe Arg Asn Pro His Gin Ala Leu Leu His Thr 
485 490 495 

gcc aac egg cca gag gac gag tgt gtg ggc gag ggc ctg gcc tgc cac 
1536 

Ala Asn Arg Pro Glu Asp Glu Cys Val Gly Glu Gly Leu Ala Cys His 
500 505 510 

cag ctg tgc gcc cga ggg cac tgc tgg ggt cca ggg ccc acc cag tgt 
1584 

Gin Leu Cys Ala Arg Gly His Cys Trp Gly Pro Gly Pro Thr Gin Cys 
515 520 525 

gtc aac tgc age cag ttc ctt egg ggc cag gag tgc gtg gag gaa tgc 
1632 

Val Asn Cys Ser Gin Phe Leu Arg Gly Gin Glu Cys Val Glu Glu Cys 
530 535 540 

cga gta ctg cag ggg etc ccc agg gag tat gtg aat gcc agg cac tgt 
1680 

Arg Val Leu Gin Gly Leu Pro Arg Glu Tyr Val Asn Ala Arg His Cys 
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545 

ttg ccg tgc cac 
1728 

Leu Pro Cys His 



ttt gga ccg gag 
1776 

Phe Gly Pro Glu 
580 

cct ccc ttc tgc 
1824 

Pro Pro Phe Cys 
595 

tec tac atg ccc 
1872 

Ser Tyr Met Pro 
610 

cct tgc ccc ate 
1920 

Pro Cys Pro lie 
625 

ggc tgc ccc gec 
1968 

Gly Cys Pro Ala 



gcg gtg gtt ggc 
2016 

Ala Val Val Gly 
660 

ate etc ate aag 
2064 

He Leu He Lys 
675 

aga ctg ctg cag 
2112 

Arg Leu Leu Gin 
690 

gcg atg ccc aac 
2160 

Ala Met Pro Asn 
705 



550 

cct> gag tgt cag 

Pro Glu Cys Gin 
565 

get gac cag tgt 
Ala Asp Gin Cys 

gtg gec cgc tgc 

Val Ala Arg Cys 
600 

ate tgg aag ttt 

He Trp Lys Phe 
615 

aac tgc acc cac 

Asn Cys Thr His 
630 

gag cag aga gec 

Glu Gin Arg Ala 
645 

att ctg ctg gtc 
He Leu Leu Val 

cga egg cag cag 

Arg Arg Gin Gin 
680 

gaa acg gag ctg 

Glu Thr Glu Leu 
695 

cag gcg cag atg 

Gin Ala Gin Met 
710 



555 

ccc cag aat ggc 

Pro Gin Asn Gly 
570 

gtg gec tgt gec 

Val Ala Cys Ala 
585 

ccc age ggt gtg 
Pro Ser Gly Val 

cca gat gag gag 

Pro Asp Glu Glu 
620 

tec tgt gtg gac 

Ser Cys Val Asp 
635 

age cct ctg acg 

Ser Pro Leu Thr 
650 

gtg gtc ttg ggg 

Val Val Leu Gly 
665 

aag ate egg aag 
Lys He Arg Lys 

gt9 9^g ccg ctg 

Val Glu Pro Leu 
700 

egg ate ctg aaa 

Arg He Leu Lys 
715 



560 

tea gtg acc tgt 

Ser Val Thr Cys 
575 

cac tat aag gac 

His Tyr Lys Asp 
590 

aaa cct gac etc 

Lys Pro Asp Leu 
605 

ggc gca tgc cag 
Gly Ala Cys Gin 

ctg gat gac aag 

Leu Asp Asp Lys 
640 

tec ate gtc tct 

Ser He Val Ser 
655 

gtg gtc ttt ggg 

Val Val Phe Gly 
670 

tac acg atg egg 

Tyr Thr Met Arg 
685 

aca cct age gga 
Thr Pro Ser Gly 

gag acg gag ctg 

Glu Thr Glu Leu 
720 
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agg aag gtg aag 
2208 

Arg Lys Val Lys 



ggc ate tgg ate 
2256 

Gly He Trp He 
740 

aaa gtg ttg agg 
2304 

Lys Val Leu Arg 
755 

gac gaa gca tac 
2352 

Asp Glu Ala Tyr 
770 

ctt ctg ggc ate 
2400 

Leu Leu Gly He 
785 

atg ccc tat ggc 
2448 

Met Pro Tyr Gly 



ctg ggc tec cag 
2496 

Leu Gly Ser Gin 
820 

atg age tac ctg 
2544 

Met Ser Tyr Leu 
835 

egg aac gtg ctg 
2592 

Arg Asn Val Leu 
850 

999 ctg get egg 
2640 

Gly Leu Ala Arg 
865 

999 ggc aag gtg 
2688 

Gly Gly Lys Val 



gtg ctt gga tct 

Val Leu Gly Ser 
725 

cct gat ggg gag 
Pro Asp Gly Glu 

gaa aac aca tec 

Glu Asn Thr Ser 
760 

gtg atg get ggt 

Val Met Ala Gly 
775 

tgc ctg aca tec 

Cys Leu Thr Ser 
790 

tgc etc tta gac 

Cys Leu Leu Asp 
805 

gac ctg ctg aac 

Asp Leu Leu Asn 

gag gat gtg egg 

Glu Asp Val Arg 
840 

gtc aag agt ccc 

Val Lys Ser Pro 
855 

ctg ctg gac att 

Leu Leu Asp He 
870 

ccc ate aag tgg 

Pro He Lys Trp 



ggc get ttt ggc 

Gly Ala Phe Gly 
730 

aat gtg aaa att 

Asn Val Lys He 
745 

ccc aaa gee aac 
Pro Lys Ala Asn 

gtg ggc tec cca 

Val Gly Ser Pro 
780 

acg gtg cag ctg 

Thr Val Gin Leu 
795 

cat gtc egg gaa 

His Val Arg Glu 
810 

tgg tgt atg cag 

Trp Cys Met Gin 
825 

etc gta cac agg 
Leu Val His Arg 

aac cat gtc aaa 

Asn His Val Lys 
860 

gac gag aca gag 

Asp Glu Thr Glu 
875 

atg gcg ctg gag 
Met Ala Leu Glu 



aca gtc tac aag 

Thr Val Tyr Lys 
735 

cca gtg gee ate 

Pro Val Ala He 
750 

aaa gaa ate tta 

Lys Glu He Leu 
765 

tat gtc tec cgc 
Tyr Val Ser Arg 

gtg aca cag ctt 

Val Thr Gin Leu 
800 

aac cgc gga cgc 

Asn Arg Gly Arg 
815 

att gee aag ggg 

He Ala Lys Gly 
830 

gac ttg gee get 

Asp Leu Ala Ala 
845 

att aca gac ttc 
He Thr Asp Phe 

tac cat gca gat 

Tyr His Ala Asp 
880 

tec att etc cgc 
Ser He Leu Arg 
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885 890 895 

egg egg ttc acc cac cag agt gat gtg tgg agt tat ggt gtg act gtg 
2736 

Arg Arg Phe Thr His Gin Ser Asp Val Trp Ser Tyr Gly Val Thr Val 
900 905 910 

tgg gag ctg atg act ttt ggg gec aaa cct tac gat ggg ate cca gec 
2784 

Trp Glu Leu Met Thr Phe Gly Ala Lys Pro Tyr Asp Gly lie Pro Ala 
915 920 925 

egg gag ate cct gac ctg ctg gaa aag ggg gag egg ctg ccc cag ccc 
2832 

Arg Glu He Pro Asp Leu Leu Glu Lys Gly Glu Arg Leu Pro Gin Pro 

930 935 940 

ccc ate tgc acc att gat gtc tac atg ate atg gtc aaa tgt tgg atg 
2880 

Pro He Cys Thr He Asp Val Tyr Met He Met Val Lys Cys Trp Met 
945 950 955 960 

att gac tct gaa tgt egg cca aga ttc egg gag ttg gtg tct gaa ttc 
2928 

He Asp Ser Glu Cys Arg Pro Arg Phe Arg Glu Leu Val Ser Glu Phe 
965 970 975 

tec cgc atg gec agg gac ccc cag cgc ttt gtg gtc ate cag aat gag 
2976 

Ser Arg Met Ala Arg Asp Pro Gin Arg Phe Val Val He Gin Asn Glu 

980 985 990 

gac ttg ggc cca gee agt ccc ttg gac age acc ttc tac cgc tea ctg 
3024 

Asp Leu Gly Pro Ala Ser Pro Leu Asp Ser Thr Phe Tyr Arg Ser Leu 
995 1000 1005 

ctg gag gac gat gac atg ggg gac ctg gtg gat get gag gag tat ctg 
3072 

Leu Glu Asp Asp Asp Met Gly Asp Leu Val Asp Ala Glu Glu Tyr Leu 
1010 1015 1020 

gta ccc cag cag ggc ttc ttc tgt cca gac cct gec ccg ggc get ggg 
3120 

Val Pro Gin Gin Gly Phe Phe Cys Pro Asp Pro Ala Pro Gly Ala Gly 
1025 1030 1035 1040 

ggc atg gtc cac cac agg cac cgc age . tea tct acc agg agt ggc ggt 
3168 

Gly Met Val His His Arg His Arg Ser Ser Ser Thr Arg Ser Gly Gly 
1045 1050 1055 
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999 9 ac c tg aca eta ggg ctg gag ccc tct gaa gag gag gec ccc agg 
3216 

Gly Asp Leu Thr Leu Gly Leu Glu Pro Ser Glu Glu Glu Ala Pro Arg 
1060 1065 1070 

tct cca ctg gca ccc tec gaa ggg get ggc tec gat gta ttt gat ggt 
3264 

Ser Pro Leu Ala Pro Ser Glu Gly Ala Gly Ser Asp Val Phe Asp Gly 
1075' 1080 1085 

gac ctg gga atg ggg gca gee aag ggg ctg caa age etc ccc aca cat 
3312 

Asp Leu Gly Met Gly Ala Ala Lys Gly Leu Gin Ser Leu Pro Thr His 
1090 1095 1100 

gac ccc age cct eta cag egg tac agt gag gac ccc aca gta ccc ctg 
3360 

Asp Pro Ser Pro Leu Gin Arg Tyr Ser Glu Asp Pro Thr Val Pro Leu 
1105 1110 1115 1120 

ccc tct gag act gat ggc tac gtt gee ccc ctg ace tgc age ccc cag 
3408 

Pro Ser Glu Thr Asp Gly Tyr Val Ala Pro Leu Thr Cys Ser Pro Gin 

1125 1130 1135 

cct gaa tat gtg aac cag cca gat gtt egg ccc cag ccc cct teg ccc 
3456 

Pro Glu Tyr Val Asn Gin Pro Asp Val Arg Pro Gin Pro Pro Ser Pro 
1140 1145 1150 

cga gag ggc cct ctg cct get gee cga cct get ggt gee act ctg gaa 
3504 

Arg Glu Gly Pro Leu Pro Ala Ala Arg Pro Ala Gly Ala Thr Leu Glu 
1155 1160 1165 

agg gee aag act etc tec cca ggg aag aat ggg gtc gtc aaa gac gtt 
3552 ; 

Arg Ala Lys Thr Leu Ser Pro Gly Lys Asn Gly Val Val Lys Asp Val 

1170 ' 1175 1180 

ttt gee ttt ggg ggt gee gtg gag aac ccc gag tac ttg aca ccc cag 
3600 

Phe Ala Phe Gly Gly Ala Val Glu Asn Pro Glu Tyr Leu Thr Pro Gin 
1185 1190 1195 1200 

99 a 99 a 9 c t gee cct cag ccc cac cct cct cct gee ttc age cca gee 
3648 

Gly Gly Ala Ala Pro Gin Pro His Pro Pro Pro Ala Phe Ser Pro Ala 
1205 1210 1215 

ttc gac aac etc tat tac tgg gac cag gac cca cca gag egg ggg get 
3696 

Phe Asp Asn Leu Tyr Tyr Trp Asp Gin Asp Pro Pro Glu Arg Gly Ala 
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1220 1225 1230 

cca ccc age acc ttc aaa ggg aca cct acg gca gag aac cca gag tac 
3744 

Pro Pro Ser Thr Phe Lys Gly Thr Pro Thr Ala Glu Asn Pro Glu Tyr 
1235 1240 1245 

ctg ggt ctg gac gtg cca gtg 
3765 

Leu Gly Leu Asp Val Pro Val 
1250 1255 



<210> 6 

<211> 1255 

<212> PRT 

<213> Homo sapiens 

<400> 6 



Met 


Glu 


Leu 


Ala 


Ala 


Leu 


Cys 


Arg 


Trp Gly Leu 


Leu Leu Ala 


Leu 


Leu 


1 








5 








10 




15 




Pro 


Pro 


Gly 


Ala 


Ala 


Ser 


Thr 


Gin 


Val Cys Thr 


Gly Thr Asp Met 


Lys 






20 










25 


30 






Leu 


Arg 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Thr His Leu 


Asp Met Leu 


Arg 


His 






35 










40 




45 






Leu 


Tyr 


Gin 


Gly 


Cys 


Gin 


Val 


Val 


Gin Gly Asn 


Leu Glu Leu 


Thr 


Tyr 




50 










55 






60 






Leu 


Pro 


Thr 


Asn 


Ala 


Ser 


Leu 


Ser 


Phe Leu Gin 


Asp He Gin 


Glu 


Val 


65 










70 






75 






80 


Gin 


Gly 


Tyr 


Val 


Leu 


He 


Ala 


His 


Asn Gin Val 


Arg Gin Val 


Pro 


Leu 










85 








90 




95 




Gin 


Arg 


Leu 


Arg 


He 


Val 


Arg 


Gly 


Thr Gin Leu 


Phe Glu Asp Asn 


Tyr 








100 










105 


110 
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<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 7 

ctagtctcta gatccgaggt gggcaactct 3 0 

<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 8 

taccgatcta gatttcggac agagaccccc 30 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 9 

ctactctcta gatccgaggg caactct 27 
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